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TITLE OF INVENTION 

CHLAMYDIA ANTIGENS AND CORRESPONDING DNA FRAGMENTS AND USES 

THEREOF 
FIELD OF INVENTION 
The present invention relates to Chlamydia antigens and corresponding DNA 
molecules, which can be used in methods to prevent and treat Chlamydia infection in mammals, 
such as humans. 

BACKGROUND OF THE INVENTION 

Chlamydiae are prokaryotes. They exhibit morphologic and structural similarities to 
gram-negative bacteria including a trilaminar outer membrane, which contains 
lipopolysaccharide and several membrane proteins that are structurally and functionally 
analogous to proteins found in E coli. They are obligate intracellular parasites with a unique 
biphasic life cycle consisting of a metabolically inactive but infectious extracellular stage and a 
replicating but non-infectious intracellular stage. The replicative stage of the life-cycle takes 
place within a membrane-bound inclusion which sequesters the bacteria away from the 
cytoplasm of the infected host cell. 

C pneumoniae is a common human pathogen, originally described as the TWAR strain 
of Chlamydia psittaci but subsequently recognised to be a new species. C. pneumoniae is 
antigenically, genetically and morphologically distinct from other chlamydia species (C 
trachomatis, C. pecorum and C psittaci). It shows 10% or less DNA sequence homology with 
either of ^trachomatis or Cpsittaci and so far appears to consist of only a single strain, TWAR. 

C pneumoniae is a common cause of community acquired pneumonia, only less frequent 
than Streptococcus pneumoniae and Mycoplasma pneumoniae (Ref 1,2). It can also cause upper 
respiratory tract symptoms and disease, including bronchitis and sinusitis (Ref 1,3,4,5). The 
great majority of the adult population (over 60%) has antibodies to C pneumoniae (Ref 5), 
indicating past infection which was unrecognized or asymptomatic. 

Of considerable importance is the association of atherosclerosis and C. pneumoniae 
infection. There are several epidemiological studies showing a correlation of previous infections 
with C. pneumoniae and heart attacks, coronary artery and carotid artery disease (Ref 6-10). 
Moreover, the organisms has been detected in atheromas and fatty streaks of the coronary, 
carotid, peripheral arteries and aorta (Ref 11-15). Viable C pneumoniae has been recovered 
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from the coronary and carotid artery (Ref 16,17). Furthermore, it has been shown that C. 
pneumoniae can induce changes of atherosclerosis in a rabbit model (Ref 18). Taken together, 
these results indicate that it is highly probable that C. pneumoniae can cause atherosclerosis in 
humans, though the epidemiological importance of chlamydial atherosclerosis remains to be 
5 demonstrated. 

A number of recent studies have also indicated an association between C pneumoniae 
infection and asthma. Infection has been linked to wheezing, asthmatic bronchitis, adult-onset 
asthma and acute exacerbations of asthma in adults, and small-scale studies have shown that 
prolonged antibiotic treatment was effective at greatly reducing the severity of the disease in 
1 0 some individuals (Ref 1 9-24). 

In light of these results a protective vaccine against G pneumoniae infection would be of 
considerable importance. There is not yet an effective vaccine for any human chlamydial 
infection. Nevertheless, studies with G trachomatis and C psittaci indicate that this is an 
attainable goal. For example, mice which have recovered from a lung infection with G 
15 trachomatis are protected from infertility induced by a subsequent vaginal challenge (Ref 25). 
Similarly, sheep immunized with inactivated G psittaci were protected from subsequent 
chlamydial-induced abortions and stillbirths (Ref 26). Protection from chlamydial infections has 
been associated with Thl immune responses, particularly the induction of INFg - producing 
2 CD4+T-celIs (Ref 27). The adoptive transfer of CD4+ cell lines or clones to nude or SCID mice 
5 20 conferred protection from challenge or cleared chronic disease (Ref 28,29), and in vivo 
% depletion of CD4+ T cells exacerbated disease post-challenge (Ref 30,31). However, the 
presence of sufficiently high titres of neutralising antibody at mucosal surfaces can also exert a 
protective effect (Ref 32). 

The extent of antigenic variation within the species G pneumoniae is not well 
25 characterised. Serovars of C trachomatis are defined on the basis of antigenic variation in 
MOMP, but published C pneumoniae MOMP gene sequences show no variation between 
several diverse isolates of the organism (Ref 33-35). Regions of the protein known to be 
conserved in other chlamydial MOMPs are conserved in C pneumoniae (Ref 33,34). One study 
has described a strain of C pneumoniae with a MOMP of greater that usual molecular weight, 
30 but the gene for this has not been sequenced (Ref 1). Partial sequences of outer membrane 
protein 2 from nine diverse isolates were also found to be invariant (Ref 16). The genes for 
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HSP60 and HSP70 show little variation from other chlamydial species, as would be expected. 
The gene encoding a 76kDa antigen has been cloned from a single strain of C. pneumoniae. It 
has no significant similarity with other known chlamydial genes (Ref 4). 

Many antigens recognised by immune sera to C. pneumoniae are conserved across all 
chlamydiae, but 98kDa, 76 kDa and 54 kDa proteins may be C. pneumoniae-specific (Ref 2, 4, 
36). Immunoblotting of isolates with sera from patients does show variation of blotting patterns 
between isolates, indicating that serotypes C. pneumoniae may exist (Ref 1,16). However, the 
results are potentially confounded by the infection status of the patients, since immunoblot 
profiles of a patient's sera change with time post-infection. An assessment of the number and 
relative frequency of any serotypes, and the defining antigens, is not yet possible. 

C. pneumoniae infection usually presents as an acute respiratory disease (i.e., cough, sore 
throat, hoarseness, and fever, abnormal chest sounds on auscultation). For most patients, the 
cough persists for 2 to 6 weeks, and recovery is slow. In approximately 10% of these cases, 
upper respiratory tract infection is followed by bronchitis or pneumonia. Furthermore, during a 
C. pneumoniae epidemic, subsequent co-infection with pneumococcus has been noted in about 
half of these pneumonia patients, particularly in the infirm and the elderly. As noted above, 
there is more and more evidence that C. pneumoniae infection is also linked to diseases other 
than respiratory infections. 

The reservoir for the organism is presumably people. In contrast to C. psittaci infections, 
there is no known bird or animal reservoir. Transmission has not been clearly defined. It may 
result from direct contact with secretions, from formites, or from airborne spread. There is a long 
incubation period, which may last for many months. Based on analysis of epidemics, C. 
pneumoniae appears to spread slowly through a population (case-to-case interval averaging 30 
days) because infected persons are inefficient transmitters of the organism. Susceptibility to C. 
pneumoniae is universal. Reinfections occur during adulthood, following the primary infection as 
a child. C. pneumoniae appears to be an endemic disease throughout the world, noteworthy for 
superimposed intervals of increased incidence (epidemics) that persist for 2 to 3 years. 
Ctrachomatis infection does not confer cross-immunity to C. pneumoniae. Infections are easily 
treated with oral antibiotics, tetracycline or erythromycin (2 g/d, for at least 10 to 14 d). A 
recently developed drug, azithromycin, is highly effective as a single-dose therapy against 
chlamydial infections. 
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In most instances, C. pneumoniae infection is often mild and without complications, and 
up to 90% of infections are subacute or unrecognized. Among children in industrialized 
countries, infections Jiave been thought to be rare up to the age of 5 y, although a recent study (E 
Normann et al, Chlamydia pneumoniae in children with acute respiratory tract infections, Acta 
Paediatrica, 1998, Vol 87, Iss 1, pp 23-27) has reported that many children in this age group 
show PCR evidence of infection despite being seronegative, and estimates a prevalence of 17- 
19% in 2-4 y olds. In developing countries, the seroprevalence of C. pneumoniae antibodies 
among young children is elevated, and there are suspicions that C, pneumoniae may be an 
important cause of acute lower respiratory tract disease and mortality for infants and children in 
tropical regions of the world. 

From seroprevalence studies and studies of local epidemics, the initial C pneumoniae 
infection usually happens between the ages of 5 and 20 y. In the USA, for example, there are 
estimated to be 30,000 cases of childhood pneumonia each year caused by G pneumoniae. 
Infections may cluster among groups of children or young adults (e.g., school pupils or military 
conscripts). 

C. pneumoniae causes 10 to 25% of community-acquired lower respiratory tract infections 
(as reported from Sweden, Italy, Finland, and the USA). During an epidemic, C. pneumonia 
infection may account for 50 to 60% of the cases of pneumonia. During these periods, also, more 
episodes of mixed infections with S. pneumoniae have been reported. 

Reinfection during adulthood is common; the clinical presentation tends to be milder. Based on 
population seroprevalence studies, there tends to be increased exposure with age, which is 
particularly evident among men. Some investigators have speculated that a persistent, 
asymptomatic C pneumoniae infection state is common. 

In adults of middle age or older, C pneumoniae infection may progress to chronic 
bronchitis and sinusitis. A study in the USA revealed that the incidence of pneumonia caused by 
C. pneumoniae in persons younger than 60 years is 1 case per 1,000 persons per year; but in the 
elderly, the disease incidence rose three-fold. C, pneumoniae infection rarely leads to 
hospitalization, except in patients with an underlying illness. 

SUMMARY OF THE INVENTION 
The present invention provides purified and isolated DNA molecules that encode 
Chlamydia polypeptides designated CPN100638 (SEQ ID No: 1,2), which can be used in 
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methods to prevent, treat, and diagnose Chlamydia infection. The encoded polypeptides include 
polypeptides having the amino acid sequence shown in SEQ ID No:3. Those skilled in the art 
will appreciate that the invention also includes DNA molecules that encode mutants and 
derivatives of such polypeptides, which result from the addition, deletion, or substitution of non- 
essential amino acids as described herein. The invention also includes RNA molecules 
corresponding to the DNA molecules of the invention. 

In addition to the DNA and RNA molecules, the invention includes the corresponding 
polypeptides and monospecific antibodies that specifically bind to such polypeptides. 

The present invention has wide application and includes expression cassettes, vectors, 
and cells transformed or transfected with the polynucleotides of the invention. Accordingly, the 
present invention provides (i) a method for producing a polypeptide of the invention in a 
recombinant host system and related expression cassettes, vectors, and transformed or transfected 
cells; (ii) a live vaccine vector, such as a pox virus, Salmonella typhimurium, or Vibrio cholerae 
vector, containing a polynucleotide of the invention, such vaccine vectors being useful for, e.g., 
preventing and treating Chlamydia infection, in combination with a diluent or carrier, and related 
pharmaceutical compositions and associated therapeutic and/or prophylactic methods; (Hi) a 
therapeutic and/or prophylactic method involving administration of an RNA or DNA molecule of 
the invention, either in a naked form or formulated with a delivery vehicle, a polypeptide or 
combination of polypeptides, or a monospecific antibody of the invention, and related 
pharmaceutical compositions; (iv) a method for diagnosing the presence of Chlamydia in a 
biological sample, which can involve the use of a DNA or RNA molecule, a monospecific 
antibody, or a polypeptide of the invention; and (v) a method for purifying a polypeptide of the 
invention by antibody-based affinity chromatography. 

BRIEF DESCRIPTION OF THE DRAWINGS 
The present invention will be further understood from the following description with 

reference to the drawings, in which: 

Figure 1 shows the nucleotide sequence of the CPN1 00638 (SEQ ID No: 1 - entire 
sequence and SEQ ID No: 2 - coding sequence) and the deduced amino acid sequence of the 
CPN1 00638 protein from Chlamydia pneumoniae (SEQ ID No: 3). The sequence is encoded on 
the negative strand. 
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Figure 2 shows the restriction enzyme analysis of the gene encoding the C. 
pneumoniae CPN100638 gene. 

DETAILED DESCRIPTION OF INVENTION 

In the C. pneumoniae genome, open reading frames (ORFs) encoding chlamydial 
polypeptides have been identified. These polypeptides include polypeptides permanently found 
in the bacterial membrane structure, polypeptides that are present in the external vicinity of the 
bacterial membrane, include polypeptides permanently found in the inclusion membrane 
structure, polypeptides that are present in the external vicinity of the inclusion membrane, and 
polypeptides that are released into the cytoplasm of the infected cell. These polypeptides can be 
used in vaccination methods for preventing and treating Chlamydia infection. 

According to a first aspect of the invention, there are provided isolated 
polynucleotides encoding the precursor and mature forms of Chlamydia polypeptides. 

An isolated polynucleotide of the invention encodes (i) a polypeptide having an 
amino acid sequence that is homologous to a Chlamydia amino acid, the Chlamydia amino acid 
sequence being selected from the group consisting of: 

(a) the amino acid sequences as shown: (SEQ ID No: 3 ) 

The term "isolated polynucleotide" is defined as a polynucleotide removed from the 
environment in which it naturally occurs. For example, a naturally-occurring DNA molecule 
present in the genome of a living bacteria or as part of a gene bank is not isolated, but the same 
molecule separated from the remaining part of the bacterial genome, as a result of, e.g. 9 a cloning 
event (amplification), is isolated. Typically, an isolated DNA molecule is free from DNA 
regions (e.g., coding regions) with which it is immediately contiguous at the 5' or 3' end, in the 
naturally occurring genome. Such isolated polynucleotides could be part of a vector or a 
composition and still be isolated in that such a vector or composition is not part of its natural 
environment. 

A polynucleotide of the invention can be in the form of RNA or DNA (e.g., cDNA, 
genomic DNA, or synthetic DNA), or modifications or combinations thereof. The DNA can be 
double-stranded or single-stranded, and, if single-stranded, can be the coding strand or the non- 
coding (anti-sense) strand. The sequence that encodes a polypeptide of the invention as shown in 
SEQ ID NOs: 1 and 2, can be (a) the coding sequence as shown in SEQ ID NOs:2 (b) a 
ribonucleotide sequence derived by transcription of (a) ; or (c) a different coding sequence; this 
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latter, as a result of the redundancy or degeneracy of the genetic code, encodes the same 
polypeptides as the DNA molecules of which the nucleotide sequences are illustrated in SEQ ID 
NOs:l to 2. 

By "polypeptide" or "protein" is meant any chain of amino acids, regardless of length 
or post-translational modification (e.g., glycosylation or phosphorylation). Both terms are used 
interchangeably in the present application. 

By "homologous amino acid sequence" is meant an amino acid sequence that differs 
from an amino acid sequence shown in SEQ ID No: 3 , only by one or more conservative amino 
acid substitutions, or by one or more non-conservative amino acid substitutions, deletions, or 
additions located at positions at which they do not destroy the specific antigenicity of the 
polypeptide. 

Preferably, such a sequence is at least 75%, more preferably 80%, and most 
preferably 90% identical to an amino acid sequence shown in SEQ ID No: 3 . 

Homologous amino acid sequences include sequences that are identical or 
substantially identical to an amino acid sequence as shown in SEQ ED No:3. By "amino acid 
sequence substantially identical" is meant a sequence that is at least 90%, preferably 95%, more 
preferably 97%, and most preferably 99% identical to an amino acid sequence of reference and 
that preferably differs from the sequence of reference, if at all, by a majority of conservative 
amino acid substitutions. 

Conservative amino acid substitutions typically include substitutions among amino 
acids of the same class. These classes include, for example, amino acids having uncharged polar 
side chains, such as asparagine, glutamine, serine, threonine, and tyrosine; amino acids having 
basic side chains, such as lysine, arginine, and histidine; amino acids having acidic side chains, 
such as aspartic acid and glutamic acid; and amino acids having nonpoiar side chains, such as 
glycine, alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan, and 
cysteine. 

Homology is typically measured using sequence analysis software (e.g., Sequence 
Analysis Software Package of the Genetics Computer Group, University of Wisconsin 
Biotechnology Center, 1710 University Avenue, Madison, WI 53705). Similar amino acid 
sequences are aligned to obtain the maximum degree of homology (i.e., identity). To this end, it 
may be necessary to artificially introduce gaps into the sequence. Once the optimal alignment 
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has been set up, the degree of homology (f.e., identity) is established by recording all of the 
positions in which the amino acids of both sequences are identical, relative to the total number of 
positions. 

Homologous polynucleotide sequences are defined in a similar way. Preferably, a 
homologous sequence is one that is at least 45%, more preferably 60%, and most preferably 85% 
identical to (i) a coding sequence of SEQ ID NOs:l and 2. 

Polypeptides having a sequence homologous to one of the sequences shown in SEQ 
ID NO: 3 , include naturally-occurring allelic variants, as well as mutants or any other non- 
naturally occurring variants that are analogous in terms of antigenicity, to a polypeptide having a 
sequence as shown in SEQ ID NO: 3 . 

As is known in the art, an allelic variant is an alternate form of a polypeptide that is 
characterized as having a substitution, deletion, or addition of one or more amino acids that does 

not alter the biological function of the polypeptide. By "biological function" is meant the 

i 

function of the polypeptide in the cells in which it naturally occurs, even if the function is not 
necessary for the growth or survival of the cells. For example, the biological function of a porin 
is to allow the entry into cells of compounds present in the extracellular medium. The biological 
function is distinct from the antigenic function. A polypeptide can have more than one 
biological function. 

Allelic variants are very common in nature. For example, a bacterial species, e.g. 9 C. 
pneumoniae, is usually represented by a variety of strains that differ from each other by minor 
allelic variations. Indeed, a polypeptide that fulfills the same biological function in different 
strains can have an amino acid sequence that is not identical in each of the strains. Such an 
allelic variation may be equally reflected at the polynucleotide level. 

Support for the use of allelic variants of polypeptide antigens comes from, e.g., 
studies of the Chlamydial MOMP antigen. The amino acid sequence of the MOMP varies from 
strain to strain, yet cross-strain antibody binding plus neutralization of infectivity occurs, 
indicating that the MOMP, when used as an immunogen, is tolerant of amino acid variations. 

Polynucleotides, e.g. 9 DNA molecules, encoding allelic variants can easily be 
retrieved by polymerase chain reaction (PCR) amplification of genomic bacterial DNA extracted 
by conventional methods. This involves the use of synthetic oligonucleotide primers matching 
upstream and downstream of the 5' and 3' ends of the encoding domain. Suitable primers can be 

8 
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designed according to the nucleotide sequence information provided in SEQ ID NOs:l and 2. 

Typically, a primer can consist of 10 to 40, preferably 15 to 25 nucleotides. It may be also 

advantageous to select primers containing C and G nucleotides in a proportion sufficient to 

ensure efficient hybridization; e.g., an amount of C and G nucleotides of at least 40%, preferably 

50% of the total nucleotide amount. 

Useful homologs that do not naturally occur can be designed using known methods 

for identifying regions of an antigen that are likely to be tolerant of amino acid sequence changes 

and/or deletions. For example, sequences of the antigen from different species can be compared 

to identify conserved sequences. 

Polypeptide derivatives that are encoded by polynucleotides of the invention include, 

e.g., fragments, polypeptides having large internal deletions derived from full-length 

polypeptides, and fusion proteins. 

Polypeptide fragments of the invention can be derived from a polypeptide having a 
sequence homologous to any of the sequences shown in SEQ ID NO: 3 , to the extent that the 
fragments retain the substantial antigenicity of the parent polypeptide (specific antigenicity). 
Polypeptide derivatives can also be constructed by large internal deletions that remove a 
substantial part of the parent polypeptide, while retaining specific antigenicity. Generally, 
polypeptide derivatives should be about at least 12 amino acids in length to maintain 
antigenicity. Advantageously, they can be at least 20 amino acids, preferably at least 50 amino 
acids, more preferably at least 75 amino acids, and most preferably at least 100 amino acids in 
length. 

Useful polypeptide derivatives, e.g., polypeptide fragments, can be designed using 
computer-assisted analysis of amino acid sequences in order to identify sites in protein antigens 
having potential as surface-exposed, antigenic regions (Ref 37). 

Polypeptide fragments and polypeptides having large internal deletions can be used 
for revealing epitopes that are otherwise masked in the parent polypeptide and that may be of 
importance for inducing a protective T cell-dependent immune response. Deletions can also 
remove immunodominant regions of high variability among strains. 

It is an accepted practice in the field of immunology to use fragments and variants of 
protein immunogens as vaccines, as all that is required to induce an immune response to a 
protein is a small (e.g., 8 to 10 amino acid) immunogenic region of the protein. This has been 
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done for a number of vaccines against pathogens other than Chlamydia. For example, short 
synthetic peptides corresponding to surface-exposed antigens of pathogens such as murine 
mammary tumor virus, peptide containing 11 amino acids; (Ref 38), Semliki Forest virus, 
peptide containing 16 amino acids (Ref 39), and canine parvovirus, 2 overlapping peptides, each 
containing 15 amino acids (Ref 40), have been shown to be effective vaccine antigens against 
their respective pathogens. 

Polynucleotides encoding polypeptide fragments and polypeptides having large 
internal deletions can be constructed using standard methods (Ref 41), for example, by PCR, 
including inverse PCR, by restriction enzyme treatment of the cloned DNA molecules, or by the 
method of Kunkel et al. (Ref 42) biological material available at Stratagene. 

A polypeptide derivative can also be produced as a fusion polypeptide that contains a 
polypeptide or a polypeptide derivative of the invention fused, e.g., at the N- or C-terminal end, 
to any other polypeptide (hereinafter referred to as a peptide tail). Such a product can be easily 
obtained by translation of a genetic fusion, z.e., a hybrid gene. Vectors for expressing fusion 
polypeptides are commercially available, such as the pMal-c2 or pMal-p2 systems of New 
England Biolabs, in which the peptide tail is a maltose binding protein, the glutathione-S- 
transferase system of Pharmacia, or the His-Tag system available from Novagen. These and 
other expression systems provide convenient means for further purification of polypeptides and 
derivatives of the invention. 

Another particular example of fusion polypeptides included in invention includes a 
polypeptide or polypeptide derivative of the invention fused to a polypeptide having adjuvant 
activity, such as, e.g., subunit B of either cholera toxin or E. coli heat-labile toxin. Several 
possibilities are can be used for achieving fusion. First, the polypeptide of the invention can be 
fused to the N-, or preferably, to the C-terminal end of the polypeptide having adjuvant activity. 
Second, a polypeptide fragment of the invention can be fused within the amino acid sequence of 
the polypeptide having adjuvant activity. 

As stated above, the polynucleotides of the invention encode Chlamydia polypeptides 
in precursor or mature form. They can also encode hybrid precursors containing heterologous 
signal peptides, which can mature into polypeptides of the invention. By "heterologous signal 
peptide" is meant a signal peptide that is not found in the naturdly-occurring precursor of a 
polypeptide of the invention. 

10 
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A polynucleotide of the invention, having a homologous coding sequence, hybridizes, 
preferably under stringent conditions, to a polynucleotide having a sequence as shown in SEQ ID 
NOstl to 2. Hybridization procedures are, e.g., described in Ausubel et aL 9 (Ref 41), Silhavy et 
aL (Ref 43); Davis et al (ref 44). Important parameters that can be considered for optimizing 
hybridization conditions are reflected in a formula that allows calculation of a critical value, the 
melting temperature above which two complementary DNA strands separate from each other Ref 
45). This formula is as follows: Tm = 81.5 + 0.5 x (% G+C) + 1.6 log (positive ion 
concentration) - 0.6 x (% formamide). Under appropriate stringency conditions, hybridization 
temperature (Th) is approximately 20 to 40°C, 20 to 25°C, or, preferably 30 to 40°C below the 
calculated Tm. Those skilled in the art will understand that optimal temperature and salt 
conditions can be readily determined empirically in preliminary experiments using conventional 
procedures. 

For example, stringent conditions can be achieved, both for pre-hybridizing and 
hybridizing incubations, (i) within 4-16 hours at 42°C, in 6 x SSC containing 50% formamide or 
(ii) within 4-16 hours at 65°C in an aqueous 6 x SSC solution (1 M NaCl, 0.1 M sodium citrate 
(PH7.0)). 

For polynucleotides containing 30 to 600 nucleotides, the above formula is used and 
then is corrected by subtracting (600/polynucleotide size in base pairs). Stringency conditions 
are defined by a Th that is 5 to 10°C below Tm. 

Hybridization conditions with oligonucleotides shorter than 20-30 bases do not 
exactly follow the rules set forth above. In such cases, the formula for calculating the Tm is as 
follows: Tm = 4 x (G+C) + 2 (A+T). For example, an 18 nucleotide fragment of 50% G+C 
would have an approximate Tm of 54°C. 

A polynucleotide molecule of the invention, containing RNA, DNA, or modifications 
or combinations thereof, can have various applications. For example, a DNA molecule can be 
used (i) in a process for producing the encoded polypeptide in a recombinant host system, (ii) in 
the construction of vaccine vectors such as poxviruses, which are further used in methods and 
compositions for preventing and/or treating Chlamydia infection, (iii) as a vaccine agent (as well 
as an RNA molecule), in a naked form or formulated with a delivery vehicle and, (iv) in the 
construction of attenuated Chlamydia strains that can over-express a polynucleotide of the 
invention or express it in a non-toxic, mutated form. 

11 
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According to a second aspect of the invention, there is therefore provided (i) an 
expression cassette containing a DNA molecule of the invention placed under the control of the 
elements required for expression, in particular under the control of an appropriate promoter; (ii) 
an expression vector containing an expression cassette of the invention; (iii) a procaryotic or 
5 eucaryotic cell transformed or transfected with an expression cassette and/or vector of the 
invention, as well as (iv) a process for producing a polypeptide or polypeptide derivative 
encoded by a polynucleotide of the invention, which involves culturing a procaryotic or 
eucaryotic cell transformed or transfected with an expression cassette and/or vector of the 
invention, under conditions that allow expression of the DNA molecule of the invention and, 
10 recovering the encoded polypeptide or polypeptide derivative from the cell culture. 

A recombinant expression system can be selected from procaryotic and eucaryotic 
hosts. Eucaryotic hosts include yeast cells {e.g., Saccharomyces cerevisiae or Pichia pastoris), 
m mammalian cells (eg., COS1, NIH3T3, or JEG3 cells), arthropods cells (e.g., Spodoptera 
frugiperda (SF9) cells), and plant cells. Preferably, a procaryotic host such as E. coli is used. 
Bacterial and eucaryotic cells are available from a number of different sources to those skilled in 
W the art, e.g., the American Type Culture Collection (ATCC; Rockville, Maryland). 

3 1 1 

;S The choice of the expression system depends on the features desired for the expressed 

f B polypeptide. For example, it may be useful to produce a polypeptide of the invention in a 

FU particular lipidated form or any other form. 

p 20 The choice of the expression cassette will depend on the host system selected as well 

as the features desired for the expressed polypeptide. Typically, an expression cassette includes 
a promoter that is functional in the selected host system and can be constitutive or inducible; a 
ribosome binding site; a start codon (ATG) if necessary, a region encoding a signal peptide, e.g., 
a lipidation signal peptide; a DNA molecule of the invention; a stop codon; and optionally a 3' 
25 terminal region (translation and/or transcription terminator). The signal peptide encoding region 
is adjacent to the polynucleotide of the invention and placed in proper reading frame. The signal 
peptide-encoding region can be homologous or heterologous to the DNA molecule encoding the 
mature polypeptide and can be specific to the secretion apparatus of the host used for expression. 
The open reading frame constituted by the DNA molecule of the invention, solely or together 
30 with the signal peptide, is placed under the control of the promoter so that transcription and 
translation occur in the host system. Promoters, signal peptide encoding regions are widely 



12 




RY51 111998 



known and available to those skilled in the art and includes, for example, the promoter of 
Salmonella typhimurium (and derivatives) that is inducible by arabinose (promoter araB) and is 
functional in Gram-negative bacteria such as E. coll (as described in U.S. Patent No. 5,028,530 
and in Cagnon et al, (Ref 46); the promoter of the gene of bacteriophage T7 encoding RNA 
polymerase, that is functional in a number of E. coli strains expressing T7 polymerase (described 
in U.S. Patent No. 4,952,496); OspA lipidation signal peptide ; and RlpB lipidation signal 
peptide (Ref 47). 

The expression cassette is typically part of an expression vector, which is selected for 
its ability to replicate in the chosen expression system. Expression vectors (e.g., plasmids or 
viral vectors) can be chosen from those described in Pouwels et aL (Cloning Vectors: A 
Laboratory Manual 1985, Supp. 1987). They can be purchased from various commercial 
sources. 

Methods for transforming/transfecting host cells with expression vectors will depend 
on the host system selected as described in Ausubel et al., (Ref 41). 

Upon expression, a recombinant polypeptide of the invention (or a polypeptide 
derivative) is produced and remains in the intracellular compartment, is secreted/excreted in the 
extracellular medium or in the periplasmic space, or is embedded in the cellular membrane. The 
polypeptide can then be recovered in a substantially purified form from the cell extract or from 
the supernatant after centrifiigation of the recombinant cell culture. Typically, the recombinant 
polypeptide can be purified by antibody-based affinity purification or by any other method that 
can be readily adapted by a person skilled in the art, such as by genetic fusion to a small affinity 
binding domain. Antibody-based affinity purification methods are also available for purifying a 
polypeptide of the invention extracted from a Chlamydia strain. Antibodies useful for purifying 
by immunoaffinity the polypeptides of the invention can be obtained as described below. 

A polynucleotide of the invention can also be useful in the vaccine field, e.g., for 
achieving DNA vaccination. There are two major possibilities, either using a viral or bacterial 
host as gene delivery vehicle (live vaccine vector) or administering the gene in a free form, e.g., 
inserted into a plasmid. Therapeutic or prophylactic efficacy of a polynucleotide of the invention 
can be evaluated as described below. 

Accordingly, in a third aspect of the invention, there is provided (i) a vaccine vector 
such as a poxvirus, containing a DNA molecule of the invention, placed under the control of 
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elements required for expression; (ii) a composition of matter containing a vaccine vector of the 
invention, together with a diluent or carrier; particularly, (iii) a pharmaceutical composition 
containing a therapeutically or prophylactically effective amount of a vaccine vector of the 
invention; (iv) a method for inducing an immune response against Chlamydia in a mammal (e.g., 
a human; alternatively, the method can be used in veterinary applications for treating or 
preventing Chlamydia infection of animals, e.g., cats or birds), which involves administering to 
the mammal an immunogenically effective amount of a vaccine vector of the invention to elicit 
an immune response, e.g., a protective or therapeutic immune response to Chlamydia ; and 
particularly, (v) a method for preventing and/or treating a Chlamydia (e.g., C. trachomatis, C. 
psittaci, C pneumonia, C. pecorum) infection, which involves administering a prophylactic or 
therapeutic amount of a vaccine vector of the invention to an individual in need* Additionally, 
the third aspect of the invention encompasses the use of a vaccine vector of the invention in the 

preparation of a medicament for preventing and/or treating Chlamydia infection. 

l 

A vaccine vector of the invention can express one or several polypeptides or 
derivatives of the invention, as well as at least one additional Chlamydia antigen, fragment, 
homolog, mutant, or derivative thereof. In addition, it can express a cytokine, such as 
interleukin-2 (IL-2) or interleukin-12 (IL-12), that enhances the immune response (adjuvant 
effect). Thus, a vaccine vector can include an additional DNA sequence encoding, e.g., a 
chlamydial antigen , or a cytokine, placed under the control of elements required for expression 
in a mammalian cell. 

Alternatively, a composition of the invention can include several vaccine vectors, 
each of them being capable of expressing a polypeptide or derivative of the invention. A 
composition can also contain a vaccine vector capable of expressing an additional Chlamydia 
antigen, or a subunit, fragment, homolog, mutant, or derivative thereof; or a cytokine such as IL- 
2 or IL-12. 

In vaccination methods for treating or preventing infection in a mammal, a vaccine 
vector of the invention can be administered by any conventional route in use in the vaccine field, 
particularly, to a mucosal (e.g., ocular, intranasal, oral, gastric, pulmonary, intestinal, rectal, 
vaginal, or urinary tract) surface or via the parenteral (e.g., subcutaneous, intradermal, 
intramuscular, intravenous, or intraperitoneal) route. Preferred routes depend upon the choice of 
the vaccine vector. The administration can be achieved in a single dose or repeated at intervals. 
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The appropriate dosage depends on various parameters understood by skilled artisans such as the 
vaccine vector itself, the route of administration or the condition of the mammal to be vaccinated 

(weight, age and the like). 

Live vaccine vectors available in the art include viral vectors such as adenoviruses 
and poxviruses as well as bacterial vectors, e.g., Shigella, Salmonella, Vibrio cholerae, 
Lactobacillus, Bacille bilte de Calmette-Gufrin (BCG), and Streptococcus. 

An example of an adenovirus vector, as well as a method for constructing an 
adenovirus vector capable of expressing a DNA molecule of the invention, are described in 
U.S. Patent No. 4,920,209. Poxvirus vectors that can be used include, e.g., vaccinia and canary 
pox virus, described in U.S. Patent No. 4,722,848 and U.S. Patent No. 5,364,773, respectively 
(also see, e.g., Tartaglia et ah, Virology (1992) 188:217) for a description of a vaccinia virus 
vector; and Taylor et al 9 Vaccine (1995) 13:539 for a reference of a canary pox). Poxvirus 
vectors capable of expressing a polynucleotide of the invention can be obtained by homologous 
recombination as described in Kieny et ah, Nature (1984) 312:163 so that the polynucleotide of 
the invention is inserted in the viral genome under appropriate conditions for expression in 
mammalian cells. Generally, the dose of vaccine viral vector, for therapeutic or prophylactic use, 
can be of from about IxlO 4 to about lxlO 11 , advantageously from about 1x1 0 7 to about lxlO 10 , 
preferably of from about lxlO 7 to about lxlO 9 plaque-forming units per kilogram. Preferably, 
viral vectors are administered parenterally; for example, in 3 doses, 4 weeks apart. Those skilled 
in the art recognize that it is preferable to avoid adding a chemical adjuvant to a composition 
containing a viral vector of the invention and thereby minimizing the immune response to the 
viral vector itself. 

Non-toxicogenic Vibrio cholerae mutant strains that are useful as a live oral vaccine 
are described in Mekalanos et ah, Nature (1983) 306:551 and U.S. Patent No. 4,882,278 (strain 
in which a substantial amount of the coding sequence of each of the two ctxA alleles has been 
deleted so that no functional cholerae toxin is produced); WO 92/1 1354 (strain in which the irgA 
locus is inactivated by mutation; this mutation can be combined in a single strain with ctxA 
mutations); and WO 94/1533 (deletion mutant lacking functional ctxA and attRSl DNA 
sequences). These strains can be genetically engineered to express heterologous antigens, as 
described in WO 94/19482. An effective vaccine dose of a Vibrio cholerae strain capable of 
expressing a polypeptide or polypeptide derivative encoded by a DNA molecule of the invention 
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can contain, e.g., about lxlO 5 to about lxlO 9 , preferably about lxlO 6 to about lxlO 8 viable 
bacteria in an appropriate volume for the selected route of administration. Preferred routes of 
administration include all mucosal routes; most preferably, these vectors are administered 
intranasally or orally. 

Attenuated Salmonella typhimurium strains, genetically engineered for recombinant 
expression of heterologous antigens or not, and their use as oral vaccines are described in 
Nakayamaef ah (Bio/Technology (1988) 6:693) and WO 92/11361. Preferred routes of 
administration include all mucosal routes; most preferably, these vectors are administered 
intranasally or orally. 

Others bacterial strains useful as vaccine vectors are described in High et al., EMBO 
(1992) 1 1 :1991 and Sizemore et ah, Science (1995) 270:299 (Shigella Jlexnerf); Medaglini et ah, 
Proc. Natl. Acad. Sci. USA (1995) 92:6868 (Streptococcus gordonii)', and Flynn J.L., Cell. Mol. 
Biol. (1994) 40 (suppl. I):31, WO 88/6626, WO 90/0594, WO 91/13157, WO 92/1796, and WO 
92/21376 (Bacille Calmette Guerin). 

In bacterial vectors, polynucleotide of the invention can be inserted into the bacterial 
genome or can remain in a free state, carried on a plasmid. 

An adjuvant can also be added to a composition containing a vaccine bacterial vector. 
A number of adjuvants are known to those skilled in the art. Preferred adjuvants can be selected 
from the list provided below. 

According to a fourth aspect of the invention, there is also provided (i) a composition 
of matter containing a polynucleotide of the invention, together with a diluent or carrier; (ii) a 
pharmaceutical composition containing a therapeutically or prophylactically effective amount of 
a polynucleotide of the invention; (iii) a method for inducing an immune response against 
Chlamydia, in a mammal, by administering to the mammal, an immunogenically effective 
amount of a polynucleotide of the invention to elicit an immune response, e.g., a protective 
immune response to Chlamydia; and particularly, (iv) a method for preventing and/or treating a 
Chlamydia (e.g., C trachomatis, C psittaci, C pneumoniae, or C. pecorum) infection, by 
administering a prophylactic or therapeutic amount of a polynucleotide of the invention to an 
individual in need. Additionally, the fourth aspect of the invention encompasses the use of a 
polynucleotide of the invention in the preparation of a medicament for preventing and/or treating 
Chlamydia infection. The fourth aspect of the invention preferably includes the use of a DNA 
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molecule placed under conditions for expression in a mammalian cell, e.g., in a plasmid that is 
unable to replicate in mammalian cells and to substantially integrate in a mammalian genome. 

Polynucleotides (DNA or RNA) of the invention can also be administered as such to a 
mammal for vaccine, e.g., therapeutic or prophylactic, purpose. When a DNA molecule of the 
invention is used, it can be in the form of a plasmid that is unable to replicate in a mammalian 
cell and unable to integrate in the mammalian genome. Typically, a DNA molecule is placed 
under the control of a promoter suitable for expression in a mammalian cell. The promoter can 
function ubiquitously or tissue-specifically. Examples of non-tissue specific promoters include 
the early Cytomegalovirus (CMV) promoter (described in U.S. Patent No. 4,168,062) and the 
Rous Sarcoma Virus promoter (described in Norton & Coffin, Molec. Cell Biol. (1985) 5:281). 
The desmin promoter (Li et al. 9 Gene (1989) 78:243, Li & Paulin, J. Biol. Chem. (1991) 
266:6562 and Li & Paulin, J. Biol. Chem. (1993) 268:10403) is tissue-specific and drives 
expression in muscle cells. More generally, useful vectors are described, La., WO 94/21797 and 
Hartikka et aL, Human Gene Therapy (1996) 7:1205. 

For DNA/RNA vaccination, the polynucleotide of the invention can encode a 
precursor or a mature form. When it encodes a precursor form, the precursor form can be 
homologous or heterologous. In the latter case, a eucaryotic leader sequence can be used, such as 
the leader sequence of the tissue-type plasminogen factor (tPA). 

A composition of the invention can contain one or several polynucleotides of the 
invention. It can also contain at least one additional polynucleotide encoding another Chlamydia 
antigen such as urease subunit A, B, or both; or a fragment, derivative, mutant, or analog thereof. 
A polynucleotide encoding a cytokine, such as interleukin-2 (IL-2) or interleukin-12 (IL-12), can 
also be added to the composition so that the immune response is enhanced. These additional 
polynucleotides are placed under appropriate control for expression. Advantageously, DNA 
molecules of the invention and/or additional DNA molecules to be included in the same 
composition, can be carried in the same plasmid. 

Standard techniques of molecular biology for preparing and purifying polynucleotides 
can be used in the preparation of polynucleotide therapeutics of the invention. For use as a 
vaccine, a polynucleotide of the invention can be formulated according to various methods. 

First, a polynucleotide can be used in a naked form, free of any delivery vehicles, 
such as anionic liposomes, cationic lipids, microparticles, e.g. 9 gold microparticles, precipitating 
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agents, e.g. y calcium phosphate, or any other transfection-facilitating agent. In this case, the 
polynucleotide can be simply diluted in a physiologically acceptable solution, such as sterile 
saline or sterile buffered saline, with or without a carrier. When present, the carrier preferably is 
isotonic, hypotonic, or weakly hypertonic, and has a relatively low ionic strength, such as 
provided by a sucrose solution, e.g., a solution containing 20% sucrose. 

Alternatively, a polynucleotide can be associated with agents that assist in cellular 
uptake. It can be, La., (i) complemented with a chemical agent that modifies the cellular 
permeability, such as bupivacaine (see, e.g., WO 94/16737), (ii) encapsulated into liposomes, or 
(iii) associated with cationic lipids or silica, gold, or tungsten microparticles. 

Anionic and neutral liposomes are well-known in the art (see, e.g., Liposomes: A 
Practical Approach, RPC New Ed, IRL press (1990), for a detailed description of methods for 
making liposomes) and are useful for delivering a large range of products, including 
polynucleotides. 

Cationic lipids are also known in the art and are commonly vised for gene delivery. 
Such lipids include Lipofectin™ also known as DOTMA (N-[l-(2,3-dioleyloxy)propyl]-N,N,N- 
trimethy lammonium chloride), DOTAP ( 1 ,2-bis(oleyloxy)-3 -(trimethylammonio)propane), 
DDAB (dimethyldioctadecy lammonium bromide), DOGS (dioctadecylamidologlycyl spermine) 
and cholesterol derivatives such as DC-Choi (3 beta-(N-(N r ,N'-dimethyl aminomethane)- 
carbamoyl) cholesterol). A description of these cationic lipids can be found in EP 187,702, 
WO 90/1 1092, U.S. Patent No. 5,283,185, WO 91/15501, WO 95/26356, and U.S. Patent 
No. 5,527,928. Cationic lipids for gene delivery are preferably used in association with a neutral 
lipid such as DOPE (dioleyl phosphatidylethanolamine), as, for example, described in 
WO 90/11092. 

Other transfection-facilitating compounds can be added to a formulation containing 
cationic liposomes. A number of them are described in, e.g., WO 93/18759, WO 93/19768, WO 
94/25608, and WO 95/2397. They include, i.a. 9 spermine derivatives useful for facilitating the 
transport of DNA through the nuclear membrane (see, for example, WO 93/18759) and 
membrane-permeabilizing compounds such as GALA, Gramicidine S, and cationic bile salts 
(see, for example, WO 93/19768). 

Gold or tungsten microparticles can also be used for gene delivery, as described in 
WO 91/359, WO 93/17706, and Tangef al (Nature (1992) 356:152). In this case, the 
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microparticle-coated polynucleotides can be injected via intradermal or intraepidermal routes 
using a needleless injection device ("gene gun"), such as those described in U.S. Patent 
No. 4,945,050, U.S. Patent No. 5,015,580, and WO 94/24263. 

The amount of DNA to be used in a vaccine recipient depends, e.g., on the strength of 
the promoter used in the DNA construct, the immunogenicity of the expressed gene product, the 
condition of the mammal intended for administration (e.g., the weight, age, and general health of 
the mammal), the mode of administration, and the type of formulation. In general, a 
therapeutically or prophylactically effective dose from about 1 \ig to about 1 mg, preferably, 
from about 10 ug to about 800 ug and, more preferably, from about 25 ug to about 250 ^g, can 
be administered to human adults. The administration can be achieved in a single dose or 
repeated at intervals. 

The route of administration can be any conventional route used in the vaccine field. 
As general guidance, a polynucleotide of the invention can be administered via a mucosal 
surface, e.g., an ocular, intranasal, pulmonary, oral, intestinal, rectal, vaginal, and urinary tract 
surface; or via a parenteral route, e.g., by an intravenous, subcutaneous, intraperitoneal, 
intradermal, intraepidermal, or intramuscular route. The choice of the administration route will 
depend on, e.g., the formulation that is selected. A polynucleotide formulated in association with 
bupivacaine is advantageously administered into muscles. When a neutral or anionic liposome or 
a cationic lipid, such as DOTMA or DC-Choi, is used, the formulation can be advantageously 
injected via intravenous, intranasal (aerosolization), intramuscular, intradermal, and 
subcutaneous routes. A polynucleotide in a naked form can advantageously be administered via 
the intramuscular, intradermal, or sub-cutaneous routes. 

Although not absolutely required, such a composition can also contain an adjuvant. If 
so, a systemic adjuvant that does not require concomitant administration in order to exhibit an 
adjuvant effect is preferable such as, e.g., QS21, which is described in U.S. Patent No. 5,057,546. 

The sequence information provided in the present application enables the design of 
specific nucleotide probes and primers that can be useful in diagnosis. Accordingly, in a fifth 
aspect of the invention, there is provided a nucleotide probe or primer having a sequence found 
in or derived by degeneracy of the genetic code from a sequence shown in SEQ ID NO:l to 2. 

The term "probe" as used in the present application refers to DNA (preferably single 
stranded) or RNA molecules (or modifications or combinations thereof) that hybridize under the 
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stringent conditions, as defined above, to nucleic acid molecules having sequences homologous 
to those shown in SEQ ID NOs:l and 2, or to a complementary or anti-sense sequence. 
Generally, probes are significantly shorter than full-length sequences shown in SEQ ID NOs:l 
and 2; for example, they can contain from about 5 to about 100, preferably from about 10 to 
about 80 nucleotides. In particular, probes have sequences that are at least 75%, preferably at 
least 85%, more preferably 95% homologous to a portion of a sequence as shown in SEQ ID 
NOs:l and 2 or that are complementary to such sequences. Probes can contain modified bases 
such as inosine, methyl-5-deoxycytidine, deoxyuridine, dimethylamino-5-deoxyuridine, or 
diamino-2, 6-purine. Sugar or phosphate residues can also be modified or substituted. For 
example, a deoxyribose residue can be replaced by a polyamide (Nielsen et al. 9 Science (1991) 
254:1497) and phosphate residues can be replaced by ester groups such as diphosphate, alkyl, 
arylphosphonate and phosphorothioate esters. In addition, the 2'-hydroxyl group on 
ribonucleotides can be modified by including, e.g., alkyl groups. 

Probes of the invention can be used in diagnostic tests, as capture or detection probes. 
Such capture probes can be conventionally immobilized on a solid support, directly or indirectly, 
by covalent means or by passive adsorption. A detection probe can be labelled by a detection 
marker selected from radioactive isotopes; enzymes such as peroxidase, alkaline phosphatase, 
and enzymes able to hydrolyze a chromogenic, fluorogenic, or luminescent substrate; compounds 
that are chromogenic, fluorogenic, or luminescent; nucleotide base analogs; and biotin. 

Probes of the invention can be used in any conventional hybridization technique, such 
as dot blot (Maniatis et aL, Molecular Cloning: A Laboratory Manual (1982) Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York), Southern blot (Southern, J. Mol. Biol. (1975) 
98:503), northern blot (identical to Southern blot to the exception that RNA is used as a target), 
or the sandwich technique (Dunn et ah, Cell (1977) 12:23). The latter technique involves the use 
of a specific capture probe and/or a specific detection probe with nucleotide sequences that at 
least partially differ from each other. 

A primer is usually a probe of about 10 to about 40 nucleotides that is used to initiate 
enzymatic polymerization of DNA in an amplification process (e.g., PCR), in an elongation 
process, or in a reverse transcription method. In a diagnostic method involving PCR, primers 
can be labelled. 
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Thus, the invention also encompasses (i) a reagent containing a probe of the invention 
for detecting and/or identifying the presence of Chlamydia in a biological material; (ii) a method 
for detecting and/or identifying the presence of Chlamydia in a biological material, in which (a) a 
sample is recovered or derived from the biological material, (b) DNA or RNA is extracted from 
the material and denatured, and (c) exposed to a probe of the invention, for example, a capture, 
detection probe or both, under stringent hybridization conditions, such that hybridization is 
detected; and (iii) a method for detecting and/or identifying the presence of Chlamydia in a 
biological material, in which (a) a sample is recovered or derived from the biological material, 
(b) DNA is extracted therefrom, (c) the extracted DNA is primed with at least one, and preferably 
two, primers of the invention and amplified by polymerase chain reaction, and (d) the amplified 
DNA fragment is produced. 

As previously mentioned, polypeptides that can be produced upon expression of the 
newly identified open reading frames are useful vaccine agents. 

Therefore, a sixth aspect of the invention features a substantially purified polypeptide 
or polypeptide derivative having an amino acid sequence encoded by a polynucleotide of the 
invention. 

A "substantially purified polypeptide" is defined as a polypeptide that is separated 
from the environment in which it naturally occurs and/or that is free of the majority of the 
polypeptides that are present in the environment in which it was synthesized. For example, a 
substantially purified polypeptide is free from cytoplasmic polypeptides. Those skilled in the art 
will understand that the polypeptides of the invention can be purified from a natural source, i.e. t a 
Chlamydia strain, or can be produced by recombinant means. 

Homologous polypeptides or polypeptide derivatives encoded by polynucleotides of 
the invention can be screened for specific antigenicity by testing cross-reactivity with an 
antiserum raised against the polypeptide of reference having an amino acid sequence as shown in 
SEQ ID NOs:3 . Briefly, a monospecific hyperimmune antiserum can be raised against a 
purified reference polypeptide as such or as a fusion polypeptide, for example, an expression 
product of MBP, GST, or His-tag systems or a synthetic peptide predicted to be antigenic. The 
homologous polypeptide or derivative screened for specific antigenicity can be produced as such 
or as a fusion polypeptide. In this latter case and if the antiserum is also raised against a fusion 
polypeptide, two different fusion systems are employed. Specific antigenicity can be determined 
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according to a number of methods, including Western blot (Towbin et aL, Proc. Natl. Acad. Sci. 
USA (1979) 76:4350), dot blot, and ELISA, as described below. 

In a Western blot assay, the product to be screened, either as a purified preparation or 
a total E. coli extract, is submitted to SDS-Page electrophoresis as described by Laemmli (Nature 
(1970) 227:680). After transfer to a nitrocellulose membrane, the material is further incubated 
with the monospecific hyperimmune antiserum diluted in the range of dilutions from about 1 :5 to 
about 1 :5000, preferably from about 1 : 100 to about 1:500. Specific antigenicity is shown once a 
band corresponding to the product exhibits reactivity at any of the dilutions in the above range. 

In an ELISA assay, the product to be screened is preferably used as the coating 
antigen. A purified preparation is preferred, although a whole cell extract can also be used. 
Briefly, about 100 |il of a preparation at about 10 \ig protein/ml are distributed into wells of a 96- 
well polycarbonate ELISA plate. The plate is incubated for 2 hours at 37°C then overnight at 
4°C. The plate is washed with phosphate buffer saline (PBS) containing 0.05% Tween 20 
(PBS/Tween buffer). The wells are saturated with 250 (il PBS containing 1% bovine serum 
albumin (BSA) to prevent non-specific antibody binding. After 1 hour incubation at 37°C, the 
plate is washed with PBS/Tween buffer. The antiserum is serially diluted in PBS/Tween buffer 
containing 0.5% BSA. 100 \xl of dilutions are added per well. The plate is incubated for 
90 minutes at 37°C, washed and evaluated according to standard procedures. For example, a 
goat anti-rabbit peroxidase conjugate is added to the wells when specific antibodies were raised 
in rabbits. Incubation is carried out for 90 minutes at 37°C and the plate is washed. The reaction 
is developed with the appropriate substrate and the reaction is measured by colorimetry 
(absorbance measured spectrophotometrically). Under the above experimental conditions, a 
positive reaction is shown by O.D. values greater than a non immune control serum. 

In a dot blot assay, a purified product is preferred, although a whole cell extract can 
also be used. Briefly, a solution of the product at about 100 |ig/ml is serially two-fold diluted in 
50 raM Tris-HCl (pH 7.5). 100 \xl of each dilution are applied to a nitrocellulose membrane 0.45 
|im set in a 96-well dot blot apparatus (Biorad). The buffer is removed by applying vacuum to 
the system. Wells are washed by addition of 50 mM Tris-HCl (pH 7.5) and the membrane is air- 
dried. The membrane is saturated in blocking buffer (50 mM Tris-HCl (pH 7.5) 0.15 M NaCl, 
10 g/L skim milk) and incubated with an antiserum dilution from about 1:50 to about 1:5000, 
preferably about 1:500. The reaction is revealed according to standard procedures. For example, 
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a goat anti-rabbit peroxidase conjugate is added to the wells when rabbit antibodies are used. 
Incubation is carried out 90 minutes at 37°C and the blot is washed. The reaction is developed 
with the appropriate substrate and stopped. The reaction is measured visually by the appearance 
of a colored spot, e.g., by colorimetry. Under the above experimental conditions, a positive 
5 reaction is shown once a colored spot is associated with a dilution of at least about 1:5, 
preferably of at least about 1 :500. 

Therapeutic or prophylactic efficacy of a polypeptide or derivative of the invention 
can be evaluated as described below. 

According to a seventh aspect of the invention, there is provided (i) a composition of 
10 matter containing a polypeptide of the invention together with a diluent or carrier; in particular, 
(ii) a pharmaceutical composition containing a therapeutically or prophylactically effective 
amount of a polypeptide of the invention; (iii) a method for inducing an immune response against 

01 Chlamydia in a mammal, by administering to the mammal an immunogenically effective amount 

2 of a polypeptide of the invention to elicit an immune response, e.g., a protective immune 
15 response to Chlamydial and particularly, (iv) a method for preventing and/or treating a 

UJ Chlamydia (e.g., C. trachomatis. C. psittaci, C pneumoniae, or C pecorum) infection, by 
2 administering a prophylactic or therapeutic amount of a polypeptide of the invention to an 
individual in need. Additionally, the seventh aspect of the invention encompasses the use of a 
0^ polypeptide of the invention in the preparation of a medicament for preventing and/or treating 
H 20 Chlamydia infection. 

J2 The immunogenic compositions of the invention can be administered by any 

conventional route in use in the vaccine field, in particular to a mucosal (e.g., ocular, intranasal, 
pulmonary, oral, gastric, intestinal, rectal, vaginal, or urinary tract) surface or via the parenteral 
(e.g., subcutaneous, intradermal, intramuscular, intravenous, or intraperitoneal) route. The 

25 choice of the administration route depends upon a number of parameters, such as the adjuvant 
associated with the polypeptide. For example, if a mucosal adjuvant is used, the intranasal or 
oral route will be preferred and if a lipid formulation or an aluminum compound is used, the 
parenteral route will be preferred. In the latter case, the sub-cutaneous or intramuscular route is 
most preferred. The choice can also depend upon the nature of the vaccine agent. For example, 

30 a polypeptide of the invention fused to CTB or LTB will be best administered to a mucosal 
surface. 
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A composition of the invention can contain one or several polypeptides or derivatives 
of the invention. It can also contain at least one additional Chlamydia antigen, or a subunit, 
fragment, homolog, mutant, or derivative thereof 

For use in a composition of the invention, a polypeptide or derivative thereof can be 
formulated into or with liposomes, preferably neutral or anionic liposomes, microspheres, 
ISCOMS, or virus-like-particles (VLPs) to facilitate delivery and/or enhance the immune 
response. These compounds are readily available to one skilled in the art; for example, see 
Liposomes: A Practical Approach {supra). 

Adjuvants other than liposomes and the like can also be used and are known in the 
art. A appropriate selection can conventionally be made by those skilled in the art, for example, 
from the list provided below. 

Administration can be achieved in a single dose or repeated as necessary at intervals 
as can be determined by one skilled in the art. For example, a priming dose can be followed by 
three booster doses at weekly or monthly intervals. An appropriate dose depends on various 
parameters including the recipient {e.g., adult or infant), the particular vaccine antigen, the route 
and frequency of administration, the presence/absence or type of adjuvant, and the desired effect 
{e.g., protection and/or treatment), as can be determined by one skilled in the art. In general, a 
vaccine antigen of the invention can be administered by a mucosal route in an amount from about 
10 ng to about 500 mg, preferably from about 1 mg to about 200 mg. For the parenteral route of 
administration, the dose usually should not exceed about 1 mg, preferably about 100 ug. 

When used as vaccine agents, polynucleotides and polypeptides of the invention can 
be used sequentially as part of a multistep immunization process. For example, a mammal can 
be initially primed with a vaccine vector of the invention such as a pox virus, e.g., via the 
parenteral route, and then boosted twice with the polypeptide encoded by the vaccine vector, e.g., 
via the mucosal route. In another example, liposomes associated with a polypeptide or derivative 
of the invention can also be used for priming, with boosting being carried out mucosally using a 
soluble polypeptide or derivative of the invention in combination with a mucosal adjuvant {e.g., 
LT). 

A polypeptide derivative of the invention is also useful as a diagnostic reagent for 
detecting the presence of vxtfx-Chlamydia antibodies, e.g., in a blood sample. Such polypeptides 
are about 5 to about 80, preferably about 10 to about 50 amino acids in length and can be labeled 
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or unlabeled, depending upon the diagnostic method. Diagnostic methods involving such a 

reagent are described below. 

Upon expression of a DNA molecule of the invention, a polypeptide or polypeptide 
derivative is produced and can be purified using known laboratory techniques. For example, the 
5 polypeptide or polypeptide derivative can be produced as a fusion protein containing a fused tail 
that facilitates purification. The fusion product can be used to immunize a small mammal, e.g., a 
mouse or a rabbit, in order to raise antibodies against the polypeptide or polypeptide derivative 
(monospecific antibodies). The eighth aspect of the invention thus provides a monospecific 
antibody that binds to a polypeptide or polypeptide derivative of the invention. 
10 By "monospecific antibody" is meant an antibody that is capable of reacting with a 

unique naturally-occurring Chlamydia polypeptide. An antibody of the invention can be 
polyclonal or monoclonal. Monospecific antibodies can be recombinant, e.g., chimeric {e.g., 
d constituted by a variable region of murine origin associated with a human constant region), 
2 humanized (a human immunoglobulin constant backbone together with hypervariable region of 
b 15 animal, e.g., murine, origin), and/or single chain. Both polyclonal and monospecific antibodies 
J) can also be in the form of immunoglobulin fragments, e.g., F(ab)'2 or Fab fragments. The 
antibodies of the invention can be of any isotype, e.g., IgG or IgA, and polyclonal antibodies can 
be of a single isotype or can contain a mixture of isotypes. 

The antibodies of the invention, which are raised to a polypeptide or polypeptide 
20 derivative of the invention, can be produced and identified using standard immunological assays, 
e.g., Western blot analysis, dot blot assay, or ELISA (see, e.g., Coligan et al, Current Protocols 
in Immunology (1994) John Wiley & Sons, Inc., New York, NY). The antibodies can be used in 
diagnostic methods to detect the presence of a Chlamydia antigen in a sample, such as a 
biological sample. The antibodies can also be used in affinity chromatography methods for 
25 purifying a polypeptide or polypeptide derivative of the invention. As is discussed further 
below, such antibodies can be used in prophylactic and therapeutic passive immunization 
methods. 

Accordingly, a ninth aspect of the invention provides (i) a reagent for detecting the 
presence of Chlamydia in a biological sample that contains an antibody, polypeptide, or 
30 polypeptide derivative of the invention; and (ii) a diagnostic method for detecting the presence of 
Chlamydia in a biological sample, by contacting the biological sample with an antibody, a 



%0 



25 



RY51 111998 

polypeptide, or a polypeptide derivative of the invention, such that an immune complex is 
formed, and by detecting such complex to indicate the presence of Chlamydia in the sample or 
the organism from which the sample is derived. 

Those skilled in the art will understand that the immune complex is formed between a 
component of the sample and the antibody, polypeptide, or polypeptide derivative, whichever is 
used, and that any unbound material can be removed prior to detecting the complex. As can be 
easily understood, a polypeptide reagent is useful for detecting the presence of anti-Chlamydia 
antibodies in a sample, e.g., a blood sample, while an antibody of the invention can be used for 
screening a sample, such as a gastric extract or biopsy, for the presence of Chlamydia 
polypeptides. 

For use in diagnostic applications, the reagent (z.e., the antibody, polypeptide, or 
polypeptide derivative of the invention) can be in a free state or immobilized on a solid support, 
such as a tube, a bead, or any other conventional support used in the field. Immobilization can 
be achieved using direct or indirect means. Direct means include passive adsorption (non- 
covalent binding) or covalent binding between the support and the reagent. By "indirect means" 
is meant that an anti-reagent compound that interacts with a reagent is first attached to the solid 
support. For example, if a polypeptide reagent is used, an antibody that binds to it can serve as 
an anti-reagent, provided that it binds to an epitope that is not involved in the recognition of 
antibodies in biological samples. Indirect means can also employ a ligand-receptor system, for 
example, a molecule such as a vitamin can be grafted onto the polypeptide reagent and the 
corresponding receptor can be immobilized on the solid phase. This is illustrated by the biotin- 
streptavidin system. Alternatively, indirect means can be used, e.g., by adding to the reagent a 
peptide tail, chemically or by genetic engineering, and immobilizing the grafted or fused product 
by passive adsorption or covalent linkage of the peptide tail. 

According to a tenth aspect of the invention, there is provided a process for purifying, 
from a biological sample, a polypeptide or polypeptide derivative of the invention, which 
involves carrying out antibody-based affinity chromatography with the biological sample, 
wherein the antibody is a monospecific antibody of the invention. 

For use in a purification process of the invention, the antibody can be polyclonal or 
monospecific, and preferably is of the IgG type. Purified IgGs can be prepared from an 
antiserum using standard methods (see, e.g., Coligan et aL 9 supra). Conventional 
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chromatography supports, as well as standard methods for grafting antibodies, are disclosed in, 
e.g., Antibodies: A Laboratory Manual, D. Lane, E. Harlow, Eds. (1988). 

Briefly, a biological sample, such as an C pneumoniae extract, preferably in a buffer 
solution, is applied to a chromatography material, preferably equilibrated with the buffer used to 
dilute the biological sample so that the polypeptide or polypeptide derivative of the invention 
{Le., the antigen) is allowed to adsorb onto the material. The chromatography material, such as a 
gel or a resin coupled to an antibody of the invention, can be in batch form or in a column. The 
unbound components are washed off and the antigen is then eluted with an appropriate elution 
buffer, such as a glycine buffer or a buffer containing a chaotropic agent, e.g., guanidine HC1, or 
high salt concentration {e.g., 3 M MgCIJ. Eluted fractions are recovered and the presence of the 
antigen is detected, e.g., by measuring the absorbance at 280 nm. 

An antibody of the invention can be screened for therapeutic efficacy as described as 
follows. According to an eleventh aspect of the invention, there is provided (i) a composition of 
matter containing a monospecific antibody of the invention, together with a diluent or carrier; (ii) 
a pharmaceutical composition containing a therapeutically or prophylactically effective amount 
of a monospecific antibody of the invention, and (iii) a method for treating or preventing a 
Chlamydia {e.g., C. trachomatis, C. psittaci, C. pneumoniae or C. pecorum) infection, by 
administering a therapeutic or prophylactic amount of a monospecific antibody of the invention 
to an individual in need. Additionally, the eleventh aspect of the invention encompasses the use 
of a monospecific antibody of the invention in the preparation of a medicament for treating or 
preventing Chlamydia infection. 

To this end, the monospecific antibody can be polyclonal or monoclonal, preferably 
of the IgA isotype (predominantly). In passive immunization, the antibody can be administered 
to a mucosal surface of a mammal, e.g., the gastric mucosa, e.g., orally or intragastrically, 
advantageously, in the presence of a bicarbonate buffer. Alternatively, systemic administration, 
not requiring a bicarbonate buffer, can be carried out. A monospecific antibody of the invention 
can be administered as a single active component or as a mixture with at least one monospecific 
antibody specific for a different Chlamydia polypeptide. The amount of antibody and the 
particular regimen used can be readily determined by one skilled in the art. For example, daily 
administration of about 100 to 1,000 mg of antibodies over one week, or three doses per day of 
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about 100 to 1,000 mg of antibodies over two or three days, can be an effective regimens for 
most purposes. 

Therapeutic or prophylactic efficacy can be evaluated using standard methods in the 
art, e.g., by measuring induction of a mucosal immune response or induction of protective and/or 
therapeutic immunity, using, e.g., the C. pneumoniae mouse model . Those skilled in the art will 
recognize that the C pneumoniae strain of the model can be replaced with another Chlamydia 
strain. For example, the efficacy of DN A molecules and polypeptides from C. pneumoniae is 
preferably evaluated in a mouse model using an C. pneumoniae strain. Protection can be 
determined by comparing the degree of Chlamydia infection to that of a control group. 
Protection is shown when infection is reduced by comparison to the control group. Such an 
evaluation can be made for polynucleotides, vaccine vectors, polypeptides and derivatives 
thereof, as well as antibodies of the invention. 

Adjuvants useful in any of the vaccine compositions described above are as follows. 

Adjuvants for parenteral administration include aluminum compounds, such as 
aluminum hydroxide, aluminum phosphate, and aluminum hydroxy phosphate. The antigen can 
be precipitated with, or adsorbed onto, the aluminum compound according to standard protocols. 
Other adjuvants, such as RIBI (ImmunoChem, Hamilton, MT), can be used in parenteral 
administration. 

Adjuvants for mucosal administration include bacterial toxins, e.g., the cholera toxin 
(CT), the E. coli heat-labile toxin (LT), the Clostridium difficile toxin A and the pertussis toxin 
(PT), or combinations, subunits, toxoids, or mutants thereof. For example, a purified preparation 
of native cholera toxin subunit B (CTB) can be of use. Fragments, homologs, derivatives, and 
fusions to any of these toxins are also suitable, provided that they retain adjuvant activity. 
Preferably, a mutant having reduced toxicity is used. Suitable mutants are described, e.g., in WO 
95/17211 (Arg-7-Lys CT mutant), WO 96/6627 (Arg-192-Gly LT mutant), and WO 95/34323 
(Arg-9-Lys and Glu-129-Gly PT mutant). Additional LT mutants that can be used in the 
methods and compositions of the invention include, e.g., Ser-63-Lys, Ala-69-Gly, Glu-1 10- Asp, 
and Glu-1 12-Asp mutants. Other adjuvants, such as a bacterial monophosphoryl lipid A 
(MPLA) of, e.g., E, coli, Salmonella minnesota, Salmonella typhimurium, or Shigella Jlexneri; 
saponins, or polylactide glycolide (PLGA) microspheres, can also be used in mucosal 
administration. 
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Adjuvants useful for both mucosal and parenteral administrations include 
polyphosphazene (WO 95/2415), DC-chol (3 b-(N-(N',N'-dimethyl aminomethane)-carbamoyl) 
cholesterol; U.S. Patent No. 5,283,185 and WO 96/14831) and QS-21 (WO 88/9336). 

Any pharmaceutical composition of the invention, containing a polynucleotide, a 
polypeptide, a polypeptide derivative, or an antibody of the invention, can be manufactured in a 
conventional manner. In particular, it can be formulated with a pharmaceutically acceptable 
diluent or carrier, e.g., water or a saline solution such as phosphate buffer saline. In general, a 
diluent or carrier can be selected on the basis of the mode and route of administration, and 
standard pharmaceutical practice. Suitable pharmaceutical carriers or diluents, as well as 
pharmaceutical necessities for their use in pharmaceutical formulations, are described in 
Remington's Pharmaceutical Sciences, a standard reference text in this field and in the USP/NF. 

The invention also includes methods in which Chlamydia infection, are treated by 
oral administration of a Chlamydia polypeptide of the invention and a mucosal adjuvant, in 
combination with an antibiotic, an antacid, sucralfate, or a combination thereof. Examples of 
such compounds that can be administered with the vaccine antigen and the adjuvant are 
antibiotics, including, e.g., macrolides, tetracyclines, and derivatives thereof (specific examples 
of antibiotics that can be used include azithromycin or doxicyclin or immunomodulators such as 
cytokines or steroids. In addition, compounds containing more than one of the above-listed 
components coupled together, can be used. The invention also includes compositions for 
carrying out these methods, i.e., compositions containing a Chlamydia antigen (or antigens) of 
the invention, an adjuvant, and one or more of the above-listed compounds, in a 
pharmaceutically acceptable carrier or diluent. 

Amounts of the above-listed compounds used in the methods and compositions of the 
invention can readily be determined by one skilled in the art. In addition, one skilled in the art 
can readily design treatment/immunization schedules. For example, the non-vaccine components 
can be administered on days 1-14, and the vaccine antigen + adjuvant can be administered on 
days 7, 14, 21, and 28. 
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get age agt ttt att age age aac aaa gca att age ttt ata aac aat 835 
Ala Ser Ser Phe lie Ser Ser Asn Lys Ala lie Ser Phe lie Asn Asn 
230 235 240 245 

agt gtg acc gca ace tea get aca ggg gga gec att tac tgt agt agt 8 83 
Ser Val Thr Ala Thr Ser Ala Thr Gly Gly Ala lie Tyr Cys Ser Ser 
250 255 260 

aca tea gec ccc aaa cca gtc tta act eta tea gac aac ggg gaa ctg 931 
Thr Ser Ala Pro Lys Pro Val Leu Thr Leu Ser Asp Asn Gly Glu Leu 
265 270 275 

aac ttt ata gga aat aca gca att act agt ggt ggg gcg att tat act 97 9 
Asn Phe lie Gly Asn Thr Ala He Thr Ser Gly Gly Ala He Tyr Thr 
280 285 290 



gac aat eta gtt ctt tct tct gga gga cct acg ctt ttt aaa aac aac 
Asp Asn Leu Val Leu Ser Ser Gly Gly Pro Thr Leu Phe Lys Asn Asn 
295 300 305 



aga aat tct att aac ate gga aac acc aat get aag att gta cag ctg 
Arg Asn Ser He Asn He Gly Asn Thr Asn Ala Lys He Val Gin Leu 
360 365 370 



gga gtc act eta gtt get aag tec ttt teg caa tct ccg ggc tct acc 
Gly Val Thr Leu Val Ala Lys Ser Phe Ser Gin Ser Pro Gly Ser Thr 
455 460 465 
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tct ggc tat gat act gca get ccc tta gga gga gca att gcg att get 1075 
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gac tct gga tct ttg agt ctt teg get ctt ggt gga gac ate act ttt 1123 
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Glu Gly Asn Thr Val Val Lys Gly Ala Ser Ser Ser Gin Thr Thr Thr 
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cga gec tct caa ggc aat act ate tac ttc tat gat cct ata aca act 1267 
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att cag caa cct eta act ctt gcg gga ggg caa etc tct ctt aaa tea 14 59 
He Gin Gin Pro Leu Thr Leu Ala Gly Gly Gin Leu Ser Leu Lys Ser 
440 445 450 
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Asn 


att 
lie 


cac 
His 


ate 
He 
555 


aca 
Thr 


gac 
Asp 


tta 
Leu 


get 
Ala 


get 
Ala 
560 


gat 
Asp 


cce 
Pro 


eta 
Leu 


gaa 
Glu 


aaa 
Lys 
565 


1795 




aat 
Asn 


cct 
Pro 


ate 
lie 


cat 
His 


tgg 
Trp 
570 


gga 
Gly 


tac 
Tyr 


caa 
Gin 


ggg 

Gly 


aat 
Asn 
575 


tgg 
Trp 


gca 
Ala 


tta 
Leu 


tct 
Ser 


tgg 
Trp 
580 


caa 
Gin 


1843 


fir? 


gag 
Glu 


gat 
Asp 


act 
Thr 


gcg 
Ala 
585 


act 
Thr 


aaa 
Lys 


tec 
Ser 


aaa 
Lys 


gca 
Ala 
590 


gcg 
Ala 


act 
Thr 


ctt 
Leu 


acc 
Thr 


tgg 
Trp 
595 


aca 
Thr 


aaa 
Lys 


1891 


P 

s _ i 

S -S 


aca 
Thr 


gga 
Gly 


tac 
Tyr 
600 


aat 
Asn 


ccg 
Pro 


aat 
Asn 


cct 
Pro 


gag 
Glu 
605 


cgt 
Arg 


cgt 
Arg 


gga 
Gly 


acc 
Thr 


tta 
Leu 
610 


gtt 
Val 


get 
Ala 


aac 
Asn 


1939 




acg 
Thr 


eta 
Leu 
615 


tgg 
Trp 


gga 
Gly 


tec 
Ser 


ttt 
Phe 


gtt 
Val 
620 


gat 
Asp 


gtg 
Val 


cgc 
Arg 


tec 
Ser 


ata 
He 
625 


caa 
Gin 


cag 
Gin 


ctt 
Leu 


gta 
Val 


1987 




gee 
Ala 
630 


act 
Thr 


aaa 
Lys 


gta 
Val 


cgc 
Arg 


caa 
Gin 
635 


tct 

Ser 


caa 
Gin 


gaa 
Glu 


act 
Thr 


cgc 
Arg 
640 


ggc 
Gly 


ate 
He 


tgg 
Trp 


tgt 
Cys 


gaa 
Glu 
645 


2035 




ggg 

Gly 


ate 
lie 


teg 
Ser 


aac 
Asn 


ttc 
Phe 
650 


ttc 
Phe 


cat 
His 


aaa 
Lys 


gat 
Asp 


age 
Ser 
655 


acg 
Thr 


aag 
Lys 


ata 
He 


aat 
Asn 


aaa 
Lys 
660 


ggt 
Gly 


2083 




ttt 

Phe 


cgc 
Arg 


cac 
His 


ata 
lie 
665 


agt 
Ser 


gca 
Ala 


ggt 
Gly 


tat 
Tyr 


gtt 
Val 
670 


gta 
Val 


gga gcg 
Gly Ala 


act 
Thr 


aca 
Thr 
675 


aca 
Thr 


tta 
Leu 


2131 




get 
Ala 


tct 
Ser 


gat 
Asp 
680 


aat 
Asn 


ctt 
Leu 


ate 
He 


act 
Thr 


gca 
Ala 
685 


gee 
Ala 


ttc 
Phe 


tgc 
Cys 


caa 
Gin 


tta 
Leu 
690 


ttc 
Phe 


ggg 

Gly 


aaa 
Lys 


2179 




gat 
Asp 


aga 
Arg 
695 


gat 
Asp 


cac 
His 


ttt 
Phe 


ata 
He 


aat 
Asn 
700 


aaa 
Lys 


aat 
Asn 


aga 
Arg 


get 
Ala 


tct 
Ser 
705 


gee 
Ala 


tat 
Tyr 


gca 
Ala 


get 
Ala 


2227 




tct 
Ser 
710 


etc 
Leu 


cat 
His 


etc 
Leu 


cag 
Gin 


cat 
His 
715 


eta 
Leu 


gcg 
Ala 


acc 
Thr 


ttg 
Leu 


tct 
Ser 
720 


tct 
Ser 


cca 
Pro 


age 
Ser 


ttg 
Leu 


tta 
Leu 
725 


2275 




cgc 


tac 


ctt 


cct 


gga 


tct 


gaa 


agt 


gag 


cag 


cct 


gtc 


etc 


ttt 


gat 


get 


2323 



Arg Tyr Leu Pro Gly Ser Glu Ser Glu Gin Pro Val Leu Phe Asp Ala 
730 735 740 
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cag 
Gin 


ate 
He 


age 
Ser 


tat 
Tyr 
745 


ate 
He 


tat 
Tyr 


agt 
Ser 


aaa 
Lys 


aat 
Asn 
750 


act 
Thr 


atg 
Met 


aaa 
Lys 


acc 
Thr 


tat 
Tyr 
755 


tac 
Tyr 


acc 
Thr 


2371 




caa 
Gin 


gca 
Ala 


cca 
Pro 
760 


aag 
Lys 


gga 
Gly 


gag 
Glu 


age 
Ser 


teg 
Ser 
765 


tgg 
Trp 


tat 
Tyr 


aat 
Asn 


gac 
Asp 


ggt 
Gly 
770 


tgc 
Cys 


get 
Ala 


ctg 
Leu 


2419 




gaa 
Glu 


ctt 
Leu 
775 


gcg 
Ala 


age 
Ser 


tec 
Ser 


eta 
Leu 


cca 
Pro 
780 


cac 
His 


act 
Thr 


get 
Ala 


tta 
Leu 


age 
Ser 
785 


cat 
His 


gag 
Glu 


ggt 
Gly 


etc 
Leu 


2467 




ttc 
Phe 
790 


cac 
His 


gcg 
Ala 


tat 


ttt 
Phe 


cct 
Pro 
795 


ttc 
Phe 


ate 
He 


aaa 
Lys 


gta 
Val 


gaa 
Glu 
800 


get 
Ala 


teg 
Ser 


tac 
Tyr 


ata 
He 


cac 
His 
805 


2515 




caa 
Gin 


gat 

Asp 


age 
Ser 


ttc 
Phe 


aaa 
Lys 
810 


gaa 
Glu 


cgt 
Arg 


aat 
Asn 


act 
Thr 


acc 
Thr 
815 


ttg 
Leu 


gta 
Val 


cga 
Arg 


tct 
Ser 


ttc 
Phe 
820 


gat 
Asp 


2563 




age 
Ser 


ggt 
Gly 


gat 
Asp 


tta 
Leu 
825 


att 
He 


aac 
Asn 


gtc 
Val 


tct 
Ser 


gtg 
Val 
830 


cct 
Pro 


att 
He 


gga 
Gly 


att 
He 


acc 
Thr 
835 


ttc 
Phe 


gag 
Glu 


2611 


%r ! 


aga 
Arg 


ttc 
Phe 


teg 
Ser 
840 


aga 
Arg 


aac 
Asn 


gag 
Glu 


cgt 
Arg 


gcg 
Ala 
845 


tct 
Ser 


tac 
Tyr 


gaa 
Glu 


get 
Ala 


act 
Thr 
850 


gtc 
Val 


ate 
He 


tac 
Tyr 


2659 


— 

U 

UJ 


gtt 
Val 

ate 
lie 
870 


gee 
Ala 
855 

aac 
Asn 


gat 
Asp 

aat 
Asn 


gtc 
Val 

acc 
Thr 


tat 
Tyr 

teg 
Ser 


cgt 
Arg 

tgg 
Trp 
875 


aag 
Lys 
8 60 

aaa 
Lys 


aat 
Asn 

act 
Thr 


cct 
Pro 

aca 
Thr 


gac 
Asp 

gga 
Gly 


tgc 
Cys 

acg 
Thr 
880 


acg 
Thr 
865 

aat 
Asn 


aca 
Thr 

etc 
Leu 


get 
Ala 

tea 
Ser 


etc 
Leu 

aga 
Arg 


eta 
Leu 

caa 
Gin 
885 


2707 
2755 


RJ 

*ssl 


get 
Ala 


ggt 
Gly 


ate 
He 


gga 
Gly 


aga 
Arg 
890 


gca 
Ala 


ggg 

Gly 


ate 
He 


ttt 
Phe 


tat 
Tyr 
895 


gec 
Ala 


ttc 
Phe 


tct 
Ser 


cca 
Pro 


aat 
Asn 
900 


ctt 
Leu 


2803 


CO 


gag 
Glu 


gtc 
Val 


aca 
Thr 


agt 
Ser 
905 


aac 
Asn 


eta 
Leu 


tct 
Ser 


atg 
Met 


gaa 
Glu 
910 


att 
He 


cgt 
Arg 


gga 
Gly 


tct 
Ser 


tea 
Ser 
915 


cgc 
Arg 


age 
Ser 


2851 




tac 
Tyr 


aat 
Asn 


gca 
Ala 
920 


gat 
Asp 


ctt 
Leu 


gga 
Gly 


ggt 
Gly 


aag 
Lys 
925 


ttc 
Phe 


cag 
Gin 


ttc 
Phe 


taaaagcgtt cctgatccct 


2904 



tagaaattct aagagatcct gagtgtatct agggacttct caaaga 2950 
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Figure 2 

Restriction enzyme analysis of CPN100638 



Tsp509I 

Hinf I Bf al Msel I 

Tfil Msel Spell BsmAI Mnll I I CviRI 

I I I I I IN I 

TGTAGATTCTTAACTTACTAGTCTCTCCTTTCCTCTTGCTTTCTTTAATTTATTGCAGTA 

x + + + + + + 

ACATCTAAGAATTGAATGATCAGAGAGGAAAGGAGAACGAAAGAAATTAAATAACGTCAT 



60 



MboII 

Msel CviJI Mnll Mnll 

Tsp509I HphI 1 Sfcl I Nlalll I Earl | 

III II II II 

TGTGGTGAAATAATTTGTTAAACCACCTATAGCCCTCTACATGAAATCCTCTCTTCATTG 

+ + h + + + 

ACACCACTTTATTAAACAATTTGGTGGATATCGGGAGATGTACTTTAGGAGAGAAGTAAC 

Bbvl 
Apol I 
Tsp509I I Fnu4HI 
Maelll I f MwoII 

Msel Tsp45I I I Tsell 

I III M 

GTTTTTAATCTCGTCATCTTTAGCACTTCCCTTGTCACTAAATTTCTCTGCGTTTGCTGC 

+ + + + + + 

CAAAAAT TAGAGCAGTAGAAATCG TGAAGGG AACAGTGAT T TAAAGAGACGCAAACGACG 

NlalV 
BseRI 
Bpml 
ScrFI 
EcoRII 



61 



121 



120 



180 



Avail 
Eco0109l 
PspSII 
Sau96I 
Sse8647I 
Bfal I 
I 



Avail 
BspGI 
Sau96l 
Hpyl78III| 
Alul \ I 

CviJI | | 



BsaXI 
Alol | 
Ppill 
I I 



181 



TGTTGTTGAAATCAATCTAGGACCTACCAATAGCTTCTCTGGACCAGGAACCTACACTCC 

+ + + +■ + + 

ACAACAACTTTAGTTAGATCCTGGATGGTTATCGAAGAGACCTGGTCCTTGGATGTGAGG 



240 



Mnll 
CviJI J 



Bed 
Tthlllll | 

CviRI I I HphI BsmAI 

li III II 

TCCAGCCCAAACAACAAATGCAGATGGAACTATCTATAATCTAACAGGGGATGTCTCAAT 

241 + + + + + + 

AGG T CGGGTTTGTTGTT T AC G T CT ACCT T GAT AG AT AT TAG AT T GT C CCC T AC AG AG T T A 

Alul 
CviJI 
Alwl 
Hpyl88IX 
Dpnl | 
BstYI | || | Acelll 
Sau3AI I || I Acil I 
Fokl CviRI II II I Mwol I I 
I III II I II I 
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300 



Dral 
Msel I 
Mmel | I 
I I I 



BsrI 
Hinfl 
Tfil 
Bmrl | 
I I 
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CACCAATGCAGGATCTCCGACAGCTCTAACCGCTTCCTGCTTTAAAGAAACTACTGGGAA 



301 + + + + + + 360 

GTGGTTACGTCCTAGAGGCTGTCGAGATTGGCGAAGGACGAAATTTCTTTGATGACCCTT 



BsaJI 
BstDSI 
Cvi JI | 
Hael I 
Haelll | 
BsaJI | | Tsp509I 

Styl I ICviJI I 

I I I 



Clal 
TaqI 
Faul 
Sthl32I| 
SfaNI | | 

Bcefl | I I 

II II 



Acil 
I 



361 



TCTTTCTTTCCAAGGCCACGGCTACCAATTTCTCCTACAAAATATCGATGCGGGAGCGAA 

+ + + + + + 

AGAAAGAAAGGTTCCGGTGCCGATGGTTAAAGAGGATGTTTTATAGCTACGCCCTCGCTT 

Alul 
CviJI 
Hindlll 
CviRI 
Fnu4HI 
Alul I 
CviJI | 

Bbvl MspAlII I I I Hinfl 

Rsal I Pvulll I I I Tfil 

Taal 1 t Tsell I I I Hpyl78III I 

III II I I I II I 

CTGTACCTTTACCAATACAGCTGCAAATAAGCTTCTCTCCTTTTCAGGATTCTCCTATTT 

+ + -+ + + + 

GACATGGAAATGGTTATGTCGACGTTTATTCGAAGAGAGGAAAAGTCCTAAGAGGATAAA 



420 



Maelll 
Tsp45l 



421 



480 



Bed 

CviJI | Alul 
Bsral NlalVI | CviJI . 

I III I 

G TC AC T AAT ACAAAC CACG AAT G CT AC CAC AGG AACAGG AG CCAT CAAGT CC AC AGGAGC 

481 + + + + + + 540 

CAGTGATTATGTTTGGTGCTTACGATGGTGTCCTTGTCCTCGGTAGTTCAGGTGTCCTCG 

CviJI 
Hael 
Haelll 
MscI 
Eael I 
Mwoll | 

TaqI Sfcl CjePI II I Mnll 

II I I I I i 

TTGTTCTATTCAGTCGAACTATAGTTGCTACTTTGGCCAAAACTTTTCTAATGACAATGG 



541 



-+ 600 



AACAAGATAAGTCAGCTTGATATCAACGATGAAACCGGTTTTGAAAAGATTACTGTTACC 



BsaJI 
Haell 
Hhal 



Hin4I | 
NlalVI 
BsaHIl | 
Narl I I 
BanI I I I 
CjePI I I I 
I I I I 



Styl 



Alul 
CviJI 
Mnll I 
Fnu4HI | | 
Tsell I I 
M I I 



Acelll 
Bbvl! 
BplI I I 
I I I 



CjePI 



Acll 
Maell 



AGGCGCCCTCCAAGGCAGCTCTATCAGTCTATCGCTAAACCCCAACCTAACGTTTGCCAA 



601 



-+ 660 



TCCGCGGGAGGTTCCGTCGAGATAGTCAGATAGCGATTTGGGGTTGGATTGCAAACGGTT 
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Sthl32I 
Bspl286I 
Bmgl 
BseSI 
NlalV) 
BanI ! | 
I ( I 



BslI 
BscGI 
Mnll 
BsaJI | 
BstDSI | 
Mnll I | 
I I I 



Msel 
Tsp509I I 
I t 



AAACAAAGCAACGCAAAAAGGGGGTGCCCTCTATTCCACGGGAGGGATTACAATTAACAA 

661 + + + + + + 720 

TTTGTTTCGTTGCGTTTTTCCCCCACGGGAGATAAGGTGCCCTCCCTAATGTTAATTGTT 

Fnu4HI 
SacII 
Taul 
Acil I 
MspAlII 

Thai I Cvi JI 

BseMII Acil I I NlalV! 

Hpyl88IX BsaJI I I Ecil I i 

SfaNII BstDSI M Acil I II 

II III MM 

TACGTTAAACTCAGCATCATTTTCTGAAAATACCGCGGCGAACAATGGCGGAGCCATTTA 

721 + + + + + + 780 

ATGCAATTTGAGTCGTAGTAAAAGACTTTTATGGCGCCGCTTGTTACCGCCTCGGTAAAT 



Msel 
Mae I I I Ddel 
I 



Cac8I 
Bfall 
Alul | [ 

CviJIM Fnu4HI Bbvl Alul Maelll 

NhelM Tsell Tsp509I CviJI Tsp4 5l 

Ml II II I 

CACGGAAGCTAGCAGTTTTATTAGCAGCAACAAAGCAATTAGCTTTATAAACAATAGTGT 

781 + + + + + + 840 

GTGCCTTCGATCGTCA/y^ATAATCGTCGTTGTTTCGTTAATCGAAATATTTGTTATCACA 



CviJI 
NlalV 
BseMII 
Mnll 
Cjel 
Sfcl 
Alul 
CviJI 
Mwol 
BbvCI | 

BpulOI | M II III Taal Rsal Cjel 

Acil Ddel | II | I Mi Sfcl I TatI I CviJI | BsrI 

I MUM III M II II I 

GACCGCAACCTCAGCTACAGGGGGAGCCATTTACTGTAGTAGTACATCAGCCCCCAAACC 

841 + + + + — + + 900 

CTGGCGTTGGAGTCGATGTCCCCCTCGGTAAATGACATCATCATGTAGTCGGGGGTTTGG 



Tsp509I Bfal 
Cjel I Spell 



Hpyl88IX 
Msel Sthl32l | BscGI 

I I I I I I II 

AGTCTTAACTCTATCAGACAACGGGGAACTGAACTTTATAGGAAATACAGCAATTACTAG 

901 + + + + + + 960 

TCAGAATTGAGATAGTCTGTTGCCCCTTGACTTGAAATATCCTTTATGTCGTTAATGATC 

Avail 
EcoO109I 
PspSlI 
Sau96I 
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Bpml 
Dral | 
Msel I I 



Sse8647I 
MboII Hpyl78lII J 
Cjel Bfalf Mnll | | 

I II III Ml 

TGGTGGGGCGATTTATACTGACAATCTAGTTCTTTCTTCTGGAGGACCTACGCTTTTTAA 

961 + + + + + + 1020 

ACCACCCCGCTAAATATGACTGTTAGATCAAGAAAGAAGACCTCCTGGATGCGAAAAATT 

Muni 
Tsp509I 
Acelll 
Bsu36I 
Ddel 
Mnll 
Alul 
CviJI 
PstI | 



CviJI 
I 



BsaXI 
Fnu4HI 
CviRI I 
Tsel | 
Sfcl | | 
I I 



Bbvl 
BslI 



HaelV 
BseRI Hin4I 
PlellHinfl I 
II I I 



1021 



AAACAACTCTGGCTATGATACTGCAGCTCCCTTAGGAGGAGCAATTGCGATTGCTGACTC 

+ + + + + + 1080 

TTTGTTGAGACCGATACTATGACGTCGAGGGAATCCTCCTCGTTAACGCTAACGACTGAG 



Dpnl 

BstYI | BsmAI 

Sau3AI I Alwl CviJI I 

Hpyl78III| I Hinfl Plel I I Taal 

Ml I III I 

TGGATCTTTGAGTCTTTCGGCTCTTGGTGGAGACATCACTTTTGAAGGAAACACAGTAGT 

1081 + + + + + + H40 

ACCTAGAAACTCAGAAAGCCGAGAACCACCTCTGTAGTGAAAACTTCCTTTGTGTCATCA 

Alul Hpyl88IX 
CviJI Hinfl I Apol Hpyl88IX 

MboII I TaqI I IPlel Tsp509l Msel I 

1 I I I I I I I I 

CAAAGGAGCTTCTTCGAGTCAGACCACTACCAGAAATTCTATTAACATCGGAAACACCAA 

1141 + + + + + + 1200 

GTTTCCTCGAAGAAGCTCAGTCTGGTGATGGTCTTTAAGATAATTGTAGCCTTTGTGGTT 



1201 



CviJI 
Cac8I 
Fnu4HI 
Alul | 
CviJI | 
MspAlI | 
PvuII t 
Bce83I | | 

Rsal II i I Dpnl 

BsrGI I If II Sau3AI I 

TatI I Tsel | I I Smll Mnll Alwl | I 

II II II I I t II 

TGCTAAGATTGTACAGCTGCGAGCCTCTCAAGGCAATACTATCTACTTCTATGATCCTAT 

+ + + + + + 1260 

ACGATTCTAACATGTCGACGCTCGGAGAGTTCCGTTATGATAGATGAAGATACTAGGATA 

Acelll 
Bbvl | 
Hpyl88IX | I 
Alul I I I 



Bbvl 
Ddel 
I 
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CviJI 
Pstll 
SfaNIi 
TspRI | 



Bfal 
I 



Fnu4HI 
SfaNI 
CviRI | 
Tsell 
BtsI | | 
Sfcl I I 
I ! I 



I Ddel 



Hpyl78III 
Msel Avail I 
BseMII |Sau96I | 
I I I ! 



CviRI 



Hinfl 
Tfil 
I 



CviRI 
I 



Bed 
Drdll I 
NlaIV| | 
I I I 



AACAACTAGCATCACTGCAGCTCTCTCAGATGCTCTAAACTTAAATGGTCCTGACCTTGC 

1261 + + + + + + 1320 

TTGTTGATCGTAGTGACGTCGAGAGAGTCTACGAGATTTGAATTTACCAGGACTGGAACG 

Hpyl88lX 
Bbvl 
Alul 
CviJI 
Hin4I | 

Hpyl78III Hin4I | | I I Bpml 
I I I I I t t 

AG GG AAT C CT GCAT ATC AAG G AAC CATC GT AT T T TC TG G AGAG AAGC T C T C GG AAG CAG A 

1321 + + + + + + 1380 

TCCCTTAGGACGTATAGTTCCTTGGTAGCATAAAAGACCTCTCTTCGAGAGCCTTCGTCT 

Alul 1 
AlwNI 
CviJI 
Mwol 
PstI 
CviRI 
BplI 
Fnu4HI 
Sfcl 
Alul I 
AlwNI | 
BplII 
BstAPI | 
CviJI | 
Mwol | 
Tsell 



1381 



Acil 
Faul Mnll I 
Tsp509I Sthl32l IMnll | | 

I I I I I I I I II M I 

AGCTGCAGAAGCTGATAATCTCAAATCTACAATTCAGCAACCTCTAACTCTTGCGGGAGG 

+ + + + + + 1440 

TCGACGTCTTCGACTATTAGAGTTTAGATGTTAAGTCGTTGGAGATTGAGAACGCCCTCC 



Hin4I 
I 



Maelll 
Tsp45I 
Hinfl I 
BplII | 
Hpyl78III | | | 
Msel | I | I 

I I I I I 



Bfal 
Plel 



CviJI 
BseRI 
Neil I 
ScrFI 1 
Mspl I I 



Ddel Sthl32I 
I I 

GCAACTCTCTCTTAAATCAGGAGTCACTCTAGTTGCTAAGTCCTTTTCGCAATCTCCGGG 

1441 + + + + + + 1500 

CGTTGAGAGAGAATTTAGTCCTCAGTGAGATCAACGATTCAGGAAAAGCGTTAGAGGCCC 

Fokl 
Cjel 
NlalV | 
Avail I I 
Sau96I| | 
CviRI | | | 

40 



Alwl 
BsaBI I 
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Mnll I | | I Dpnl 
Mnll | II I I MspAlI Sau3AI I 

Nlalll I I II II Acil | Bed | I 

BslI || I II II BsmFI | IBslII I I 
lilt M I I 111 Mil 

CTCTACCCTCCTCATGGATGCAGGGACCACATTAGAAACCGCTGATGGGATCACTATCAA 

1501 + + + + + + 1560 

GAGATGGGAGGAGTACCTACGTCCCTGGTGTAATCTTTGGCGACTACCCTAGTGATAGTT 



SfaNI 
Banll I 
Bspl286l | 
I I 



Cjel 
I 



Bsal 
Hinfl BsmAI 
Tfil Msel 
I I 



Bspl286I 
Cac8I 
Bmgl | 
BseSI i 
I ! 



Bsbl 
I 



T AAT CTT GT T CTCAAT GT AGATTCCT T AAAAGAGACCAAGAAGGGCACGCTAAAAGCAAC 

1561 + + + + + + 1620 

ATTAGAACAAGAGTTACATCTAAGGAATTTTCTCTGGTTCTTCCCGTGCGATTTTCGTTG 



Tthlllll 
Maelll 
Taal 
Tsp45I 
Hpyl88IX | 
Mwol Cjel I I 
I III 



Hpyl78III 
Dpnl 
BstYI 



Dpnl 
Sau3AI I 



Sau3AI 
Cjel | 
Alwl I I 



Hpyl78III| | Alwl | 



t 



I 



I I 
I I 



I I I 

ACAAGCAAGTCAGACAGTCACTTTATCTGGATCGCTCTCTCTTGTAGATCCTTCTGGAAA 

1621 + + + + + + 1680 

TGTTCGTTCAGTCTGTCAGTGAAATAGACCTAGCGAGAGAGAACATCTAGGAAGACCTTT 



Pflll08I 
AccI | 
CjePIl | 
I I I 



MboII 
BsmAI | 
Bce83I | | 
I I 



CjePI 
Smll | Mnll 

I I I 



BsmAI 



TGTCTACGAAGATGTCTCTTGGAATAACCCTCAAGTCTTTTCTTGTCTCACTCTTACTGC 



1681 



-+ 1740 



ACAGATGCTTCTACAGAGAACCTTATTGGGAGTTCAGAAAAGAACAGAGTGAGAATGACG 



Dpnl 
Sau3AI 
Fnu4HI 
Alull 
Alwl 1 
CviJI | 
Tsel) 

Bbvl Ddel I | II Bfai 

I I II II I 

TGACGACCCCGCGAATATTCACATCACAGACTTAGCTGCTGATCCCCTAGAAAAAAATCC 

1741 + + + + + + 1800 

ACTGCTGGGGCGCTTATAAGTGTAGTGTCTGAATCGACGACTAGGGGATCTTTTTTTAGG 



Faul 
Sspl 
Sthl32I| 
Thai I I 
SimI Acil I I I 
I i 



Tsp509I 

BsaJI I BciVI 

BciVI Styl | Mnll | 

I II II 

TATCCATTGGGGATACCAAGGGAATTGGGCATTATCTTGGCAAGAGGATACTGCGACTAA 

1801 + + + + + + I860 

ATAGGTAACCCCTATGGTTCCCTTAACCCGTAATAGAACCGTTCTCCTATGACGCTGATT 



Hinfl 



BspGI 



BpulOI 
Ddel 
Hpyl78lII 
Hinfl I 
Tfil I 
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Fnu4HI | ScrFI | Hgal I I 

Tsell I Bbvl | | Hpyl88IX I I 

Plelll I EcoRII | IBciVI BseMII I | I 

lit) III I lit I 

ATCCAAAGCAGCGACTCTTACCTGGACAAAAACAGGATACAATCCGAATCCTGAGCGTCG 

1861 + + + + + + 1920 

TAGGTTTCGTCGCTGAGAATGGACCTGTTTTTGTCCTATGTTAGGCTTAGGACTCGCAGC 



Cjel 
Ddell 
NlalV | f 
I I I 



Dpnl 
NlalV 
BamHI 
BstYI 
Sau3AI 
Alwl | 
I I 



Hhal 
Hin4I I 
Alwl Cjel | | 
I lit 



TGGAACCTTAGTTGCTAACACGCTATGGGGATCCTTTGTTGATGTGCGCTCCATACAACA 

1921 + + + + + + 1980 

ACCTTGGAATCAACGATTGTGCGATACCCCTAGGAAACAACTACACGCGAGGTATGTTGT 



Alul Bce83l Hpyl78lII 

CviJI CviJI I Rsal Smll I 

I til II 



Fnu4HI 
Taul 
Acil | 
Thai | 
I I 



Dpnl 
BstYI | 
Sau3AI | 
SfaNI | | 

I I I 



ffl 



GCTTGTAGCCACTAAAGTACGCCAATCTCAAGAAACTCGCGGCATCTGGTGTGAAGGGAT 

1981 + + + + + + 2040 

CGAACATCGGTGATTTCATGCGGTTAGAGTTCTTTGAGCGCCGTAGACCACACTTCCCTA 

XmnI 
Alwl 

Hpyl78lII | 

TaqI II Ms II 

MboIII ! | MboII BspMI | CviRI 

I I I I I III 

CTCGAACTTCTTCCATAAAGATAGCACGAAGATAAATAAAGGTTTTCGCCACATAAGTGC 

2041 + + + + + + 2100 

GAGCTTGAAGAAGGTATTTCTATCGTGCTTCTATTTATTTCCAAAAGCGGTGTATTCACG 



m 



CviJI 
PstI 
TspRI 
Fnu4HI 
CviRI | 

Hpyl88lX Tsell 
Alul I BtsI I I 

Bsgl CviJI | Sfcl I I 

I II I I I 

AGGTTATGTTGTAGGAGCGACTACAACATTAGCTTCTGATAATCTTATCACTGCAGCCTT 

2101 + + + + + + 2160 

TCCAATACAACATCCTCGCTGATGTTGTAATCGAAGACTATTAGAATAGTGACGTCGGAA 



Bbvl 
Tsp509I 
Sthl32I |Hpyl78III 
I I I 



Dpnl 
Sau3AI | 
BsaBIl | 

I I I 



Alul 
CviJI 



CTGCCAATTATTCGGGAAAGATAGAGATCACTTTATAAATAAAAATAGAGCTTCTGCCTA 

2161 + + + + + + 2220 

GACGGTTAATAAGCCCTTTCTATCTCTAGTGAAATATTTATTTTTATCTCGAAGACGGAT 



Alul 
CviJI 
Fnu4HI | 
Bpml | | 
CviRI | | 



Tthllll 
SfaNI | 
Bbslf | 



Mwol 
Maelll | 
Alul | f 
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I 



Mwoll I Bbvl MboIII I CviJI 

Tsell 1 Bed Bfal II I Hindlll 1 

III 1 I I I I I I I I 

TGCAGCTTCTCTCCATCTCCAGCATCTAGCGACCTTGTCTTCTCCAAGCTTGTTACGCTA 

2221 h ■ i ~ "i — — — — (- — 1 — — h 2280 

ACGTCGAAGAGAGGTAGAGGTCGTAGATCGCTGGAACAGAAGAGGTTCGAACAATGCGAT 

Sfcl 
BseMII 
Alul 
CviJI 
Dpnl 



Hpyl88IX 
Dpnl 
BstYI I 
Sau3AI | 
ScrFI I I 
EcoRII I I I lAlwI 
I I 



SfaNI 
CviJI I 
Fnu4HI | I 
Tsell I I 
I I I 



Bbvl 
I 



Sau3AI 
Hpyl88lX| 
Ddel | | 
Mnlll II 
I I II 



CCTTCCTGGATCTGAAAGTGAGCAGCCTGTCCTCTTTGATGCTCAGATCAGCTATATCTA 



2281 



-+ 2340 



GGAAGGACCTAGACTTTCACTCGTCGGACAGGAGAAACTACGAGTCTAGTCGATATAGAT 

Banll 
BsiHKAI 
Bspl286I 
SacI 
BssSI | 
Alul | 
CviJI | 
Tthlllll II 
CjePI Taqll I I I 
I I I I I 

TAGTAAAAATACTATGAAAACCTATTACACCCAAGCACCAAAGGGAGAGAGCTCGTGGTA 



2341 



- + 2400 



ATCATTTTTATGATACTTTTGGATAATGTGGGTTCGTGGTTTCCCTCTCTCGAGCACCAT 



Hpyl78III 
Hhal I 
CjePI I I 
Taal I I I 
I I I 



Banll 
- BsiHKAI 
Bspl286I 
SacI 
Alul | 
CviJI I 
Cac8I I I 



Nlalll 
MboII 
CviJI 
Mnll 



2401 



Msel I 
TspRI | I 
BtsI I I I 

III I II I 

TAATGACGGTTGCGCTCTGGAACTTGCGAGCTCCCTACCACACACTGCTTTAAGCCATGA 

+ + + + + + 2460 

ATTACTGCCAACGCGAGACCTTGAACGCTCGAGGGATGGTGTGTGACGAAATTCGGTACT 



Thai 
AflHI 
Mlul 
Earl I 
Bsal I I 
SimI BsmAI I I 
1 



2461 



Alul 
CviJI 
Hindlll I Rsal 
I I I 

GGGTCTCTTCCACGCGTATTTTCCTTTCATCAAAGTAGAAGCTTCGTACATACACCAAGA 

+ + + + + + 2520 

CCCAG AG AAGGT GC GCAT AAAAG G AAAG T AG T T T C AT CT TC GAAG CAT G T AT GT GG T T CT 



Alul 
CviJI 



Maell 



BsaJI 
Styl 



Dpnl 
Sau3AI I 
Rsal I I 
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Msel 
Pad 
Tsp509l 
Msel | 



Maell 
Hphll 



I I 
1 I 
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2521 



2581 



| | I I I I I I Mill 

TAGCTTCAAAGAACGTAATACTACCTTGGTACGATCTTTCGATAGCGGTGATTTAATTAA 

+ + + + + + 2580 

AT CGAAGTTT CTTGCAT TATGATGGAACCATGCT AGAAAGCTAT CGCCACTAAATT AAT T 

Hgal 
Hpyl78III 
Hpyl78III 
TaqI 
Avail 
Smll | 
Hin4I I I 
Hinfl Mil BplI 
BsmAI Hpyl78III |XhoI| I I Cac8I| 

BsmBI Tsp509I TaqI ITfil Mil Hin4I| 

I | | | I 1 I I I II 

CGTCTCTGTGCCTATTGGAATTACCTTCGAGAGATTCTCGAGAAACGAGCGTGCGTCTTA 

+ + + + + + 2640 

GCAGAGACACGGATAACCTTAATGGAAGCTCTCTAAGAGCTCTTTGCTCGCACGCAGAAT 



Hpyl78III 

Alul Bs 9 J Hinfl I Alul 

CviJI Taal Maell BsaBI I Tfil I CviRI CviJI 

II I III!) I 

CGAAGCTACTGTCATCTACGTTGCCGATGTCTATCGTAAGAATCCTGACTGCACGACAGC 



2641 



+ 2700 



GCTTCGATGACAGTAGATGCAACGGCTACAGATAGCATTCTTAGGACTGACGTGCTGTCG 



Acelll 



BSSSI 



Bce83I 
Sfcl I 
Mnll I I 



Hpyl78III 
I 



Hinfl I Alul 

Tfil Smll | CviJI 

| | I II I I I I 

TCTCCTAATCAACAATACCTCGTGGAAAACTACAGGAACGAATCTCTCAAGACAAGCTGG 



2701 



+ 2760 



AGAGGATTAGTTGTTATGGAGCACCTTTTGATGTCCTTGCTTAGAGAGTTCTGTTCGACC 



Hpyl88IX 
Earl | 
Sapl I 



2761 



Maelll 
Tsp45I 
Hpyl78III I 
Smll | I 
Mnll I I I Maelll 

I II I I 
TATCGGAAGAGCAGGGATCTTTTATGCCTTCTCTCCAAATCTTGAGGTCACAAGTAACCT 

+ + „+ + + + 2820 

AT AG CCTTCTCGTCC CT AG AAAAT AC GG AAGAG AGG T T TAG AACT CC AG T G T T CAT T GG A 



MboII 
Dpnl | 
BstYI | | 
Sau3AI | I 
Hin4I | | | Alwl 
till 



Tsp509I 
BslI I 
Bce83I | I 
I I I 



Dpnl 
BstYI 
Sau3AI 
MboII 
Apol 



Alul 
CviJI 
Fnu4HI I 
Tsell I 
Alwl I I I 
I I I I 



Dpnl 
Mnll I 
BglHl 1 
BstYI I | 
Sau3AI | | 
Bbvl I I 1 
CviRI MM 
I I I I I 



BsrI 



2821 



ATCTATGGAAATTCGTGGATCTTCACGCAGCTACAATGCAGATCTTGGAGGTAAGTTCCA 

______ 4______— -i | — — -f — — — _— —-f- 2880 

TAGATACCTTTAAGCACCTAGAAGTGCGTCGATGTTACGTCTAGAACCTCCATTCAAGGT 

Ddel 
Hpyl78III 
Dpnl 1 
BstYI I I 
Sau3AI I I 
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Dpnl BseMII 

Sau3AI I Apol I 

Hpyl78IIH I Tsp509I Alwl | 

Alwl II IDdel I Ddell ! I I Bfal 

I III I I M 

GTTCTAAAAGCGTTCCTGATCCCTTAGAAATTCTAAGAGATCCTGAGTGTATCTAGGGAC 



2881 



+ 2940 



CAAGATTTTCGCAAGGACTAGGGAATCTTTAAGATTCTCTAGGACTCACATAGATCCCTG 



BsmFI 
I 

TTCTCAAAGA 

2941 + 2950 

AAGAGTTTCT 
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